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CROSS-REFERENCE TO RELATED APPLICATION 
This application is a continuation-in-part of 
U.S. Application Serial No. 240,768, filed September 2, 
1988 which is a continuation-in-part of U.S. Appli- 
cation Serial No. 115,139, filed October 3C, 1987, 
which disclosures are hereby incorporated herein by 
reference. 

INTRODUCTION 



Technical Field 

This invention relates to compositions and 
20 methods for preparation of novel polypeptides, in 

particular fusion polypeptides, using recombinant DNA 
techniques. 

Background 

25 The advent of genetic engineering brought with 

it the promise of easy production of large quantities 
of a variety of peptides. However, this promise has 
not been fully realized for a number of reasons. For 
example, in many instances where the peptide has been 

30 produced and retained in the cytoplasm of the host 
organism, inclusion bodies have resulted requiring 
denaturation and renaturation of the protein, frequent- 
ly with only partial or little success. In other 
instances, the peptide has been substantially degraded 

35 so that not only are yields low, but also complicated 
mixtures are obtained, which are difficult to 



separate. 
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As a potential solution to these difficulties/ 
the possibility of obtaining secretion of a desired 
peptide into the nutrient medium has been investi- 
gated. Obtaining secretion of the desired protein has 
5 met with limited success in the past, since not all 
proteins are capable of being secreted by the host 
cells which have been employed. Moreover, even when 
secreted, the processing of the peptide by the host 
cell may result in a product which differs from the 

10 composition and/or conformation of the desired poly- 
peptide and the yields of protein have been less than 
expected. There is, therefore, a substantial interest 
in developing systems for the efficient and economic 
production of active peptides where the desired poly- 

15 peptide can accumulate in the host. cell without 

degradation and can either be secreted in an active 
conformation or conveniently processed and renatured to 
a functional state. 

20 SUMMARY OF THE INVENTION 

Expression cassettes, and methods for their 
preparation and use are described, which provide for 
enhanced expression and production of an active gene 
product. The expression cassettes include efficient 

25 transcriptional and translational initiation and ter- 
mination regulatory regions appropriate for the host 
cell to provide for expression of a desired polypep- 
tide. The expression cassette preferably further 
includes, as appropriate for the host cell, a leader 

30 sequence for expression under the transcriptional and 
translational regulation of the regulatory region, 
sequences providing for enzymatic or chemical cleavage 
sites for cleavage of the leader peptide from mature 
polypeptide, and regulatory sequences which allow the 

35 time of expression of the gene of interest to be 

modulated. The expression cassettes are introduced 
into a host cell under conditions whereby the resulting 
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transformants stably maintain the expression cassette. 
Naturally occuring DNA and synthetic genes may be 
employed for the production of a polypetide of 
interest. 

5 

DESCRIPTION OF THE SPECIFIC EMBODIMENTS 
In accordance with the subject invention, ex- 
pression cassettes are provided which, when inserted 
into a host cell, allow for the preparation of a poly- 
10 peptide of interest which has enhanced stability and is 
either secreted in an active conformation or may be 
conveniently processed and renatured to an active 
state. 

To obtain increased expression of a poly- 
15 peptide of interest in a host cell, the nucleotides 

encoding the N-terminal amino acids of the polypeptide 
of interest are modified within the constraints of 
codon degeneracy to mimic those of the natural gene 
sequence found with the Shine-Dalgarno sequence used in 
20 the expression cassette. The expression cassette thus 
will have the following general structure: 

P — S.D. — met — G x 

wher.ein 

P comprises a promoter sequence including the 
25 regulatory regions occurring at about -35 and -10 

nucleotides upstream from the start of the RNA chain 
and may also include regulatory sequences allowing for 
the induction of regulation; 

S.D. comprises a Shine-Dalgarno sequence; 
30 met comprises a codon for the initiating 

methionine of the polypeptide of interest; and 

d comprises the gene for the polypeptide of 
interest wherein the first 7 to 30 codons of the gene 
have been modified wherever possible, using codon 
35 degeneracy to approximate the nucleotide sequence of 

the natural gene which would follow the Shine-Dalgarno 
sequence used in the expression cassette. 
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As an alternate means of obtaining increased 
expression of the polypeptide of interest from the host 
cell/ the polypeptide can be expressed as a fusion 
protein by including in the expression cassette a DNA 
5 sequence encoding a leader sequence peptide joined in 
reading frame upstream from the gene of interest. 
Expressing the polypeptide of interest as a fusion 
protein can result in up to 30% or more of the protein 
produced by the host cell being the polypeptide of 
10 interest. The expression cassette for expressing a 
fusion protein will thus have the following basic 
structure: 

P — S.D. — met — L — G 

wherein: 

15 P, S.D. and met have the meaning described 

above ; 

G comprises a gene for the polypeptide of 
interest; and 

L comprises a DNA sequence encoding a leader 

20 peptide which may be an N-terminal sequence from any 

bacterial or bacteriophage gene, but generally is from 
a highly expressed gene; an amino acid sequence 
containing large numbers of. hydrophobic amino acid 
residues; or an amino acid sequence containing large 

25 numbers of hydrophilic amino acid residues. When L 
comprises a hydrophobic amino acid sequence, this 
sequence will preferably also function as a signal 
sequence, allowing secretion of the polypeptide of 
interest from the host cell and cleavage of the signal 

30 sequence from the polypeptide. The DNA sequence coding 
for L may also be modified wherever possible, using 
codon degeneracy to approximate * the nucleotide sequence 
of the natural gene which would follow the Shine- 
Dalgarno sequence used in the expression cassette. 

35 The expression cassette described above 

provides for a fused expression product comprising the 
leader peptide and the polypeptide of interest. If it 
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is desired to obtain the polypeptide of interest alone 
and there is no convenient cleavage site, e.g. as 
provided by a natural signal sequence, a cleavage site 
may be provided for by joining at least one codon 
5 encoding a cleavage site in reading frame upstream from 
the gene of interest. The cassette will thus have the 
following structure: 

P — S.D. — met — L — C — G 
wherein P, S.D., met, L and G have the meaning 
10 described above and 

C comprises at least one codon providing for a 
chemical or enzymatic cleavage site. 

To stabilize the mRNA and to provide for 
higher levels of expression of a desired polypeptide, a 
15 transcriptional termination region (T) can be included 
in the expression cassette downstream from the gene of 
interest. An example of an expression cassette 
comprising T is as follows: 

P — S.D. — met — G — T 
20 although T may be included in any of the expression 
cassettes as described above. 

Construction of Expression Cassettes 

Design of an expression system to yield high 

25 levels of gene product must take into consideration not 
only the particular regions of a gene which have been 
determined to influence expression but also how these 
regions (and thus their sequences) influence each 
other. Where possible, choice of appropriate req- 

30 ulatory sequences will take into account the various 

factors which affect expression. Different genes have 
evolved a combination of all of these factors to yield 
a particular rate of expression; thus highly expressed 
genes can be considered useful models. 

35 In terms of transcriptional regulation, the 

amount and stability of messenger RNA are important 
factors which influence the expression of gene 
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products. The amount of mRNA is determined by the copy 
number of a particular gene, the relative efficiency of 
its promoter and the factors which regulate the 
promoter, such as enhancers or repressors. Initiation 
5 is believed to occur in the region just upstream of the 
beginning of the coding sequence. 

The promoter in prokaryotic cells comprises 
nucleotide sequences which can affect the efficiency of 
transcription. These sequences include the regulatory 

10 regions at about -35 -and -10 nucleotides from the start 
of the RNA chain. Efficient promoters include those in 
which the nucelotide sequence of the -35 and -10 
regulatory regions is substantially the same as con- 
sensus sequences for these regions in bacterial 

15 promoters from highly efficient genes*. Generally these 
regions are about 5 nucleotides annd 6 nucleotides, 
respectively, in length, and each sequence may vary by 
. about 1 nucleotide in length and/or in sequence. A. 
preferred sequence for the -35 consensus regulatory - 

20 sequence is from the trp promoter, namely TGACA, and 
for the -10 consensus regulatory sequence is from the 
lac promoter, namely TATAAT. 

Not only the nucleoide sequences but also the 
spacing of the ccnsensus sequences of the -35 and -10 

25 regulatory regions, with respect to each other, is 

important for obtaining optimum transcription of the 
gene of interest « Generally, the consensus sequences 
of the -35 and -10 regulatory regions are separated by 
about 16 to 18 nucleotides, preferably by about 17 

30 nucleotides. 

Ilustrative transcriptional regulatory regions 
or promoters which provide for efficient transcription 
include the B-gal promoter, lambda left and right 
promoters, the trp and lac promoters and trp-lac (tac) 

35 fusion promoters, and the like. Synthetic promoters 
having sequences substantially similar to these 
sequences may also find use. A preferred promoter is a 
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fusion promoter comprising the -35 regulatory region 
from the trp promoter and the -10 regulatory region 
from the lac promoter. Most preferably, the promoter 
is one in which the -35 trp consensus sequence is 
5 located about 17 nucleotides upstream from the -10 lac 
consensus sequence. 

The transcriptional regulatory region may 
additionally include regulatory sequences which allow 
the time of expression of the gene of interest to be 

10 modulated, for example by presence or absence of nut- 
rients or expression products in the growth medium, 
temperature, and the like. For example, expression of 
the gene of interest may be regulated by temperature of 
the host cell growth medium by including a regulatory 

15 sequence comprising the bacteriophage lambda P L pro- 
moter, the bacteriophage 0 L operator and the gene CI857 
which codes for the temperature-sensitive C x repressor 
in the expression vector. This would allow regulation 
of the promoter by interaction between the repressor 

20 and the operator at low temperatures, for example about 
30°C. Increasing the temperature to about 42°C would 
inactivate the repressor and allow expression of the 
gene of interest. 

As an example of modulation using growth- 

25 medium nutrients, regulation of the lac or the trp-lac 
hybrid promoter can be accomplished by use of the gene 
for the LacI repressor, which binds in the lac promoter 
region downstream from the -10 regulatory region. The 
LacI repressor gene may be present on an episome, pref- 

30 erably the laclq enhanced mutant, or can be included in 
the expression cassette itself. Presence of a signif- 
icant concentration of the repressor in the growth med- 
ium inhibits the promoter function in the absence of 
inducers. Thus addition of IPTG or lactose to the host 

35 cell growth medium enhances promoter function. When 

che bacterial strain is Lac + , la ::tcse may be used as an 
inducer instead cf IPTG. 
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The transcriptional regulatory region may 
additionally include regulatory sequences which 
terminate transcription and which provide sequences or 
structures which inhibit degradation of the mRNA and 
5 thus increase the stability of the mRNA species and 
allow for higher expression. Several examples of 
prokaryotic sequences are known — the trp terminator f 
the gene 32 (T4) terminator/ or synthetic terminators 
which are similar in sequence to gene 32. 
10 In terms of translational regulation, given 

the presence of mRNA, expression can be regulated by 
influencing the rate of initiation (ribosome binding to 
the mRNA) , the race of elongation (translocation of the 
ribosome across the mRNA) , the rate of post- 
15 translational modifications and the stability of the 
gene product. The rate of elongation is probably 
affected by codon usage; the use of codons for rare 
tRNA's jnay reduce the translation rate* It is 
therefore preferable to use codons which frequently 
20 appear in genes normally expressed by the host cell to 
increase the translation rate. 

Downstream from the -35 and -10 regulatory 
regions is a consensus nucleotide sequence, generally 
AGGA, termed the Shine-Dalgarno sequence, which is 
25 believed to be involved in ribosomal binding. Optimium 
ribosomal binding and initiation of translation can be 
achieved by using a ribosomal binding site functional 
in the host cell from a highly expressed gene. * 
Evidence also points to the presence of nucleotide 
30 sequences surrounding the Shine-Dalgarno sequence and 
sequences within the coding region which can affect 
ribosome binding, possibly by the formation of 
structural motifs through which the ribosome recognizes 
the initiation site, thus altering nucleotide sequences 
35 of the coding region can be used to achieve optimum 

ribosomal binding and initiation of translation. The 
sequence of the first about 7 to 30 codons after the 
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initiating codon ATG can also affect binding and 
expression. Preferably the leader sequence and the 
Shine-Dalgarno sequence are obtained from the same 
gene, or where they are obtained from different genes, 
5 the codons of the leader sequence can be modified using 
codon degeneracy to approximate the nucleotide sequence 
of the natural gene that follows the leader sequence. 

The position of the AGGA sequence with respect 
to the initiating ATG codon can influence expression. 

10 Generally the Shine-Dalgarno sequence is located from 
about 5 to 9 nucleotides from the initiating codon , 
although, unexpectedly, high levels of expression can 
be achieved using expression cassettes wherein the 
Shine-Dalgarno sequence is located from about 10 to 13 

15 nucleotides, preferably 11 to 12 nucleotides from the 
initiating codon. 

Stability of the mRNA is governed by the 
susceptibility of the mRNA to ribonuclease enzymes. In 
general, exonuclease digestion is inhibited by the 

20 presence of structual motifs at the ends of the mRNA; 

palindromic structures, altered nucleotides or specific 
nucleotide sequences. Endonuclease digestion is 
believed to occur at specific recognition sites within 
the mRNA and stable mRNA would lack these sites. There 

25 is also some evidence that mRNAs undergoing high levels 
of translation are also protected from degradation by 
the presence of ribosomes on the mRNA. 

Stability of the expression product is ^ided 
by expression of the desired gene product as a fused 

30 polypeptide in which the desired polypeptide is 

expressed in conjunction with a second polypeptide or 
fragment thereof, especially a bacterial polypeptide. 
Preferably, stability of the expression product is 
achieved by providing for synthesis of a fusion protein 

35 in which the polypeptide of interest is expressed, 

joined to a leader sequence. A DNA sequence encoding 
an N-termiaal amino acid sequence from, for example, a 
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highly expressed bacterial or bacteriophage gene such 
as the bacteriophage lambda N protein gene or cro gene 
or the B-galactosidase gene is joined upstream from and 
in reading frame with the gene of interest. The leader 
5 sequence usually includes from about 8 to about 35 , 

preferably from about 15 to about 25 f -N-terminal amino 
acids. 

Expression of the polypeptide of interest as a 
fused protein with a leader sequence from another gene 

10 has several advantages in addition to providing for 

stability. For example , the presence of the N-terminal 
amino acids provides a means for using general purif- 
ication techniques for purification of any of a variety 
of polypeptides. For example, the N-terminal amino 

15 acids of the N-protein are predictably antigenic, and 
thus specific antibodies raised against the N-terminal 
amino acids of the N-protein may be used for the amino 
purification of the fusion proteins containing the N- 
terminus of the N-protein. Furthermore, the N-terminus 

20 of the N-protein has a high positive charge, which 

facilitates purification of the desired protein by ion- 
exchange chromatography, and the like. 

The leader sequence can also be a hydrophobic 
amino acid sequence, which may additionally function as 

25 a signal sequence for secretion. A DNA sequence 

encoding the signal sequence is joined upstream from 
and in reading frame with the gene of interest. 
Typically, the signal sequence includes a cleavage site 
which is recognized by a signal sequence peptidase. 

30 Thus, positioning the polypeptide of interest directly 
after the signal sequence cleavage site will allow it 
to be specifically cleaved from the signal sequence and 
secreted as a mature polypeptide. Examples of 
hydrophobic amino acid sequences include the bacterial 

35 -alkaline phosphatase signal sequence; the OMP-A,B,C,D,E 
or F signal sequences; the LPP signal sequence, b- 
lactamase signal sequence; and toxin signal sequences. 
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Other leader sequences which can be used 
include hydrophilic sequences, for example the N- 
terminal 41 amino acid residues from amphiregulin which 
may provide for modification of the function of the 
5 polypeptide of interest. In addition, a cytotoxic 

agent such as a toxin A-chain fragment, ricin A-chain, 
snake venom growth arresting peptide, or a targeting 
molecule such as a hormone or antibody can be coupled 
covalently with the leader sequence with in most cases 

10 minimal effect on the biological activity of the gene 
product of interest. As with the other leader 
sequences, a DNA sequence encoding the leader sequence 
is joined upstream from and in reading frame with the 
gene of interest. 

15 Where the leader sequence is not a signal 

sequence or does not contain a convenient natural 
cleavage site, additional amino acids may be inserted 
between the gene of interest and the leader sequence to 
provide an enzymatic or chemical cleavage site for 

20 cleavage of the leader peptide, following purification 
of the fusion protein, to allow for subsequent purifi- 
cation of the mature polypeptide. For example, intro- 
duction of acid-labile aspartyl-proline linkages 
between the two segments of the fusion protein facili- 

25 tates their separation at low pH. This method is not 
suitable if the desired polypeptide is acid-labile. 
The fusion protein may be cleaved with, for example, 
cyanogen bromide, which is specific for the carboxy 
side of methionine residues. Positioning a methionine 

30 between the leader sequence and the desired polypeptide 
would allow for release of the desired polypeptide. 
This method is not suitable when the desired poly- 
peptide contains methionine residues. 

Where the leader sequence comprises a signal 

35 sequence, genes of interest with secretory leader 

sequences can be expressed with or without the leader 
sequence under conditions where the sequence may be 
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retained or cleaved. In addition, to obtain a high 
proportion of the desired polypeptide as a mature , 
cleaved and refolded peptide secreted into the medium, 
it is preferable to use a promoter, such as the tac 

5 promoter, which can operate at a lower temperature, for 
example about 30°C. Unexpectedly, higher levels of 
secretion can be obtained at the lower temperatures. 
Extremely high expression levels can prevent full 
translational modifications of the protein to occur, 

10 resulting in aggregation and accumulation of uncleaved 
precursor {i.e., structural protein and secretory 
leader). Similarly, growth at elevated temperatures, 
for example 42°C, also tends to result in aggregation 
and accumulation of uncleaved precursor.. 

15 The polypeptide of interest may be any 

polypeptide for which expression is desired and may be 
either homologous (derived from the host cell) or het- 
erologous (derived from a foreign source or synthetic 
DNA sequence). The polypeptide may be derived from 

20 prokaryotic sources, or eukaryotic sources, which eu- 
karyotic sources may include fungi, protists, vert- 
ebrates, invertebrates, and the like. The polypeptide 
of interest may include enzymes such as isopenicillin 
synthetase; mammalian peptides such as inter leukins, 

25 cytokines, growth factors, e.g. epidermal growth 

factor, platelet-derived growth factor, oncostatin M, 
TGF-a and -8/ viral growth factors, e.g. Vaccinia 
Virus, Shopes fibroma; snake venom growth-arresting 
peptide, brain-derivable peptides, immunoglobulins and 

30 fragments thereof, and the like. 

Where the gene of interest is to be expressed 
in a host which recognizes the natural transcriptional 
and translational regulatory regions of the desired 
gene of interest, the entire gene with its natural 5 1 

35 and 3 1 -regulatory regions may be introduced into an 

appropriate expression vector. However, where the gene 
is to be expressed in a host which recognizes the 
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natural transcriptional and translational regulatory 
regions less well , further manipulation may be 
required. The non-coding 5 '-region upstream from the 
gene of interest may be removed by endonuclease 
5 restriction, Bal 31 resection, or the like. 

Alternatively, where a convenient restriction site is 
present near the 5' -terminus of the gene of interest, 
the gene of interest may be restricted and an adapter 
employed for linking the gene of interest to the 

10 promoter region, where the adaptor provides for the 

lost nucleotides of the gene of interest. A variety of 
3 '-transcriptional regulatory regions. are known and may 
be inserted downstream from the stop codons. 

The DNA sequences encoding the polypeptide of 

15 interest can be synthesized using conventional 

techniques giving overlapping single strands which may 
be ligated together to define the desired coding 
sequences. The termini can be designed to provide 
restriction sites or one or both termini may be blunt- 

20 ended for ligation to complementary ends of an 

expression vector. For expression of the sequence an 
initiating methionine is provided. Expression vectors 
are generally available and are amply described in the 
literature. 

25 Instead of synthesizing the gene of interest, 

the gene may be isolated by various techniques. These 
include isolating mRNA from a host organism which codes 
for the polypeptide of interest, the mRNA reverse 
transcribed, the resulting single-stranded (ss) DNA 

30 used as a template to prepare double-stranded (ds) DNA 
and the ds DNA isolated. Another technique is to 
isolate a piece of the host cell genomic DNA, and using 
a probe, appropriately degenerate, comprising a region 
of the most conserved sequences in the gene encoding 

35 the polypeptide of interest, identify sequences 

encoding the polypeptide of interest in the host cell 
genome. The probe can be considerably shorter than the 
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entire sequence, but should be at least 10 , preferably 
at least 14 , more preferably at least 20 nucleotides in 
length. Longer oligonucleotides are also useful, up to 
the full length of the gene encoding the polypetide of 
5 interest. Both DNA and RNA probes can be used. 

In use, the probes are typically labeled in a 
detectable manner {for example with 32 P-labelled or 
biotinylated nucleotides) and are incubated with 
single-stranded DNA or RNA from the organism in which a 

10 gene is being sought. Hybridization is detected by 
means of the label after single-stranded and double- 
stranded (hybridized) DNA or DNA/RNA have been 
separated, typically using nitrocellulose paper. 
Hybridization techniques suitable for use with 

15 oligonucleotides are well known to those skilled in the 
art. 

Although probes are normally used with a 
detectable label that allows for easy identification, 
unlabeled oligonucleotides are also useful, both as 

20 precursors of labeled probes and for use in methods 
that provide for direct detection of DNA or DNA/RNA. 
Accordingly, the term "oligonucleotide" refers to both 
labeled and unlabeled forms. 

Once the desired DNA sequence has been 

25 obtained it may be manipulated in a variety of ways to 
provide for expression. Por example, chimeric 
polypeptide sequences may be prepared by combining gene 
fragments of a least two polypeptides having sequences 
substantially similiar to naturally occuring 

30 polypeptide chains. It is highly desirable that the 
three dimensional structure of the polypeptide be 
retained, particularly that portion of the structure 
which may be responsible for biological activity of the 
polypeptide of interest. Depending upon the source of 

35 the fragments and the length of the desired 

polypeptide, convenient restriction sites may be 
designed into the synthetic genes used to construct the 
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chimeric polypeptides. When possible the restriction 
sfte(s) leaves the amino acid sequence of the 
polypeptide unaltered. However, in some cases 
incorporation of the new restriction site(s) may yield 
5 an altered amino acid sequence without changing the 
activity of the protein. 

During the construction of the expression 
cassette, various fragments of the DNA will usually be 
cloned in an appropriate cloning vector, which allows 

10 for amplification of the DNA, modification of the DNA 
or manipulation by joining or removing of sequences, 
linkers, or the like. Normally, the vectors will be 
capable of replication in at least a relatively high 
copy number in bacteria. A number of vectors are 

15 readily available for cloning in gram-negative 

bacteria, especially E. coli , including such vectors as 
pBR322, pACYC184 , M13, Charon 4A and the like. The 
cloning vectors are characterized by having an 
efficient replication system functional in the host 

20 bacteria. 

The cloning vector will have at least one 
unique restriction site, usually a plurality of unique 
restriction sites and may also include multiple res- 
triction sites. In addition, the cloning vector will 

25 have one or more markers which provide for selection of 
transformants. The markers will normally provide 
resistance to cytotoxic agents such as antibiotics, 
heavy metals, toxins or the like, complementation of an 
auxotrophic host, or immunity to a phage. By 

30 appropriate restriction of the vector and the cassette, 
and, as appropriate, modification of the ends, by 
chewing back or filling in overhangs, to provide for 
blunt ends, by addition of linkers, by tailing, 
complementary ends can be provided for ligation and 

35 joining of the vector to the expression cassette or 
component thereof . 
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After each manipulation of the DNA in the 
development of the cassette, the plasmid will be cloned 
and isolated and, as required, the particular cassette 
component analyzed as to its sequence to ensure that 
5 the proper sequence has been obtained ♦ Depending upon 
the nature of the manipulation, the desired sequence 
may be excised from the plasmid and introduced into a 
different vector or the plasmid may be restricted and 
the expression cassette component manipulated, as 

10 appropriate. 

In some instances a shuttle vector will be 
employed where the vector is capable of replication in 
different hosts requiring different replication 
systems. This may or may not require additional 

15 markers which are functional in the two hosts* Where 

such markers are required, these can be included in the 
vector, where the plasmid containing the cassette, two 
replication systems and the marker (s) may be 
transferred from one host to another, as required. For 

20 selection, any useful marker may be used. Desirably, 

resistance to neomycin or tetracycline are of interest. 
However, although a marker for selection is highly 
desirable for convenience, other procedures for 
screening transformed cells have been described. See 

25 for example G. Reipin et al. Current Genetics (1982) 

189-193. Transformed cells may also be screened by the 
specific products they make, for example, synthesis o'f 
the desired product may be determined by immunological 
or enzymatic methods. 

30 The expression cassette may be included within 

a replication system for episomal maintenance in an 
appropriate celluar host or may be provided without a 
replication system, where it may become integrated into 
the host genome. The DNA may be introduced into the 

35 host in accordance with known techniques, such as 

transformation, using calcium phosphate-precipitated 



WO 89/03886 



17, 



PCT/US88/03872 



DNA, transfection by contacting the cells with a virus, 
microinjection of the DNA into cells , and the like. 

Once the gene of interest has been introduced 
into the appropriate host, the host may be grown to 
5 express the gene of interest. A variety of prokaryotic 
hosts may be employed. Host cells can include gram- 
negative organisms such as E. coli , e.g., JM109, JM101, 
and 107; HB101, DH1 or DH5. Particularly suitable are 
gram-positive organisms such as B. subtilis which have 

10 no periplasmic space and directly secrete polypeptides 
into the growth medium. 

The host cell may be grown to high density in 
an appropriate nutrient medium. Where the promotor is 
inducible, permissive conditions will then be employed, 

15 for example, temperature change, exhaustion or excess 
of a metabolic product or nutrient/ or the like. For 
example, where the regulatory sequence comprises the 
bacteriophage XP L promoter, the bacteriophage 0 L 
operator, and the CI857 temperature sensitive repres- 

20 sor, the host cells may be grown at the permissive 

temperature, generally about 30°C, at which temperature 
transcription from the P L promoter is repressed and the 
host cells may grow unhindered by the demands of the 
synthesis of the foreign gene product, which addition- 

25 ally may be toxic to the host organism. When the host • 
cells have reached an optimal density, the temperature 
may be increased to a non-permissive temperature, for 
example about 42°C, at which time the CI repressor is 
rendered inactive , permitting trainscription from .the P L 

30 promoter. 

Maximal secretion can be obtained by using the 
lac promoter or a trp-lac promoter and induction with a 
metabolic inducer such as lactose for a lac + host 
strain, and providing lacl^ on a vector. Examples of 
35 host cells which could be used with this system include 
DEI, DH5 or EB101. 
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Where the product is retained in the host 
cell, the cells are harvested, lysed and the product 
isolated and purified by extraction, precipitation, 
chromotography, electrophoresis, and the like. Where 
5 the product is secreted into the periplasmic space, the 
cells are harvested and the product is liberated by 
destruction of the cell wall, e.g., by hypotonic shock 
and the like. Where the product is secreted into the 
medium , the nutrient medium may be collected and the 

10 product isolated by conventional means, for example, 

affinity chromotography. To produce an active protein 
it may be necessary to allow the protein to refold. If 
the protein is expressed as a fusion protein with the 
leader sequence, the leader sequence may be removed by 

15 treatment with for example formic acid or cyanogen 
bromide. The leader sequence preferably is removed 
after refolding of the protein. 

The following examples are offered by way of 
20 illustration and not by way of limitation. 
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20 

Example II 
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A. Plasmid pBMll 
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2. Construction of pBM8 
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B. Construction of pBMllM4 

1. Construction of pBMll/M 
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F. Construction of pBM16t 

G. Construction of pBM16/NDP 

H. Construction of pBMll/PAD (also called pBMllM3/PAD) 

I. Construction of pBM14 

1. Construction of pBM12 

2. Construction of pBM13 

3. Final Construction of pBM14 

J. Plasmid pLEBam 
K. Plasmid plac/cro-B gal 
L. Plasmid ptac/cro-s gal 
R. Plasmid TacPak 
15 N. Plasmid pTCPt 
O. Plasmid pTNPt 

1. Preparation of pBM16t 

2. Preparation of the 2.8kb EcoR I- BamH I fragment 
of pBM16t/VGFa lacking the Hindlll site 

3. Preparation of the 150bp BamHI-BsmI fragment 
of pBMll/PAK 

4. Preparation of oligonucleotides TacA+ and 
TacA- 

5. Ligation and Isolation of pTNPt 



20 



25 



30 



Example III , 
Preparation of Genes of Interest . 

A. Synthetic Growth Factor Genes 

1- TGF Synthetic Oligonucleotides 

2. VGF Synthetic Oligonucleotides 

3. EGF Synthetic Oligonucleotides 

4. Assembly Growth Factor Genes 

B. Synthetic Platelet Factor 4 Gene 

C. DNA Cloning of Oncostatin M 

1. Preparation of cDNA Libraries 

35 2. Restriction Site Mapping 

3. DNA Sequence of Oncostatin M 

4. RNA Analysis 
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Example IV . 

Expression of the Polypeptide of Interest 
as a Fusion Protein with the N-Protein 

A. Modified Synthetic TGF 

1. Preparation of pBMl 1/N/TGF 
3. Modified Synthetic TGF-VGF Hybrid 

1. Preparation of pBMll/N/TTV 
C. Synthetic Platelet Factor 4 

1. Preparation of pBMll/N/PF4 



10 



Example V . 

15 Expression of the Polypeptide of Interest 

, as a Fusion Protein 
with the N-Protein and a Cleavage Site 

A. Modified Synthetic VGF 

1. Preparation of pBMll/NDP/VGFA 
20 2 « Preparation of pBMll/NDP/VGFa 

B. Modified Synthetic TGF-VGF Hybrids 

1. Preparation of pBMll/NDP/TTV 

2. Preparation of pBMll/NDP/VTV 

3. Preparation of pBM16/NDP/TW 

25 C. Synthetic EGF 

.. 1. Preparation of pBMll/NDP/EGF 

D. Synthetic Platelet Factor 4 (PF4) 
1. Preparation of pBMll/NDP/PF4 

E. Oncostatin M 

1. Construction of pBM16/NDP/OncoM 

2. Preparation of pBMX/OncoM 



30 
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Example VI , 

Expression of the Polypeptide of Interest 
As a Fusion Protein 
with the Modified Alkaline Phosphatase Signal Sequence 

5 A. Preparation of pBMll/PAD/EGP 

1. Preparation of 0.17kb EcoR I-BamHI fragment of 
EGP 

2. Preparation of 0.5kb Pvu I -Hind III fragment 
of pBMll/PAD 

3. Preparation of the 5.2kb-Pvu I-BamHI fragment 
1Q of pBMll/PAD 

4. Ligation and isolation of pBMll/PAD/EGF 

B. Preparation of pBMll/PAD/OncoM 

1. Preparation of modified Oncostatin M gene 
fragment 

2. Preparation of pBMllM3/PAD fragments 

15 3. Ligation and isolation of pBMll/PAD/OncM 

C. Preparation of pBMll/PAD/nVGPa 

1. Preparation of 0.5kb Hindlll-Pvul digested 
pBMll/PAD 

2. Preparation of the 5.2kb PvuI-BamHI pBMll plasmid 
fragment 

3. Preparation of the 170bp Ncol (blunt )-BamHI 
synthetic VGFa gene 

4. Ligation and isolation of pBMll/PAD/nVGFa 



20 



D. Preparation of pBMll/PAD/PF4 

25 Example VII , 

Expression of the Polypeptide of Interest 
as a Fusion Protein 
with the Alkaline Phosphatase Signal Sequence 

A. Preparation of pBMll/PAK/nVGFa 

30 

B. -Preparation of pBMll /PAK/EGF 

C. Preparation of TacPak/EGF 
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Example VIII . 

Expression of the Polypeptide of Interest 
as a Fusion Protein 
with the Alkaline Phosphatase Signal Sequence 
Using an Expression Cassette 
5 Comprising a Transcriptional Termination Region 

A. Preparation of pTCPt/EGF 

1. Preparation of the 420bp Hindlll ( blunt ) -BamHI 
fragment of TacPak/EGF 

2. Preparation of the 2.8kb EcoRI ( blunt ) -BamHI 
fragment of pBMl 6 1 /NDP/VGFa 

1 9 3-. Ligation and Isolation of pTCPt/EGF 

B. Preparation of pTCPt/nVGFa 

1. Preparation of the 350bp Pvu I- BamH I fragment of 
pBMl 1/P AK/nVGFa 

2. Preparation of the 2.8kb Pvu I- BamH I fragment of 
15 pTCPt/EGF 

3. Ligation and Isolation of pTCPt/nVGFa 

C. Preparation of pTNPt/EGF 

1. Preparation of 2. 8kb PvuI-BamHI pTNPt 

2. Preparation of 300bp Pvu I- BamH I fragment of 
pBMl 1/P AK/EGF 

20 3. Ligation and Isolation of pTNPt/EGF 

Example IX . 

Isolation of Recombinant Polypeptides 

25 A. Growth Factors Produced in pBM-Based Vectors Using the 
PL Promoter and the ts CI Repressor 

1. TGF and Modified TGF 
a. N/TGF 

2. Modified and Truncated VGF 



30 



a . PAD/nVGFa 

b . NDP/VGFa 

c . VGFa 

d . NDP/VGFA 
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3. Chimeric TGF/VGF Hybrids 

a, N/TTV 

b. NDP /TTV 
c • NDP/VTV 
d. NDP/TW 

B. Growth Factors Produced in Vectors Comprising the tac 
or lac Promoters 

1. PAK/EGF 

2 . PAK/nVGFa 

C. Platelet Factor 4 

1. N/PF4 

2 . NDP/PF4 

D. Oncostatin M 

1. NDP/Oncostatin M 
15 2. PAD/Oncostatin M 

Example X . 

Biological Activity of Recombinant Growth Factors 
Prepared in Prokaryotic Cells 

A. EGF Receptor Binding 
1. Receptor Binding of Chimeric Peptides 

B. Mitogenic Activity 

C. Wound Healing 

1. Mid-Dermal Injuries 

2. Mid-Dermal Donor-Graft Injuries 

Example XI . 

Biological Activity of Recombinant Platelet Factor 4 
Prepared in Prokaryotic Cells 

A. Inhibition of DNA Synthesis 

B. Inhibition of Growth of Tumors in Nude Mice 
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Example XII . 

Biological Activity of Recombinant Oncostatin M 
Prepared in Prokaryotic Cells 

A. Physicocheraical Characterization 
1 . SDS-PAGE 

B. Growth-Inhibitory Activity of Recombinant Oncostatin M 

C. Receptor Binding Activity of Recombinant Oncostatin M 
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Biological Deposits 

The following expression plasmids, all 
transformed into E. coli HB101, were deposited on the 
indicated date with the American Type Culture Collection, 
12301 Park lawn Drive, Rockville, MD 20852 and have the 
identification and ATCC Designations given below: 



10 
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IDENTIFICATION 

pBMll 
pBH14 

pBMll/PA/VGF 
pBMll/DP/VGFa 
pBMll/PA/EGF 
pBMll/M5 
PBM11/C2 
•pBMll/NDP/EGF 



ATCC DESIGNATION DATE OF DEPOSIT 



67366 
67367 
67417 
67418 
67419 
67436 
67437 
67547 



June 3, 1987 
June 3, 1987 
June 3, 1987 



October 23, 1987 



20 



Methods 



General cloning techniques were used as described 
in Maniatis et al. , 1982 "Molecular Cloning: A Laboratory 
Manual" , Cold Spring Harbor Laboratory, CSH, New York. All 
25 DNA-modifying enzymes were obtained from commercial 

suppliers. They were used according to the manufacturers 
instructions. Materials and apparatus for DNA 
purification and separation were used according to 
instructions from the supplier. 

30 

Example I. 



Activity Assays 

35 A. Mitoqenic Assay 

The mitogenesis assays were performed as 
follows: Diploid human fibroblasts obtained from explants 
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of newborn foreskin were seeded at a density of 3xl0 4 
cells/well (96-well plates , Nunclon, Roskilde, Denmark) 
and were grown to confluency in Dulbecco's modified 
Eagle's medium (GIBCO)/10% newborn calf serum. Cultures 
5 were then placed in medium containing 0.2% newborn calf 
serum, and two days later various concentrations of the 
growth factor to be tested were added. After 8 hrs/ 
cultures were labeled with 5-[ 125 I ] iodo-2 ■ -deoxyuridine 
(Amersham, 10 yCi/ml, 5 Ci/mg; 1 Ci = 37 GBq) , and the 
10 amount of isotope incorporated into TCA insoluble material 
' was determined as described (Twardzik et al. , Proc. Natl. 
Acad. Sci. USA (1985) 182:5300-5304). 

B. Soft Agar Colony Growth Stimulation Assay 

15 A 0.5 ml base layer of 0.5% agar fAgar Noble; 

Difco Laboratories, Detroit, Michigan) in growth medium 
was added to 24-well Costar tissue cluture plates. One- 
half ml 0.3% agar in growth medium containing 1 to 1.5xl0 4 
cells/ml NRK cells or other cell line of interest and 

20 various concentrations of the factor to be tested was 
overlaid on the base layer of agar. The plates' were 
incubated at 37°C in a humidified atmosphere of 5% C0 2 in 
air and refed after 7 days by addition of 0.5 ml of 0.3% 
agar in growth medium and containing the same concen- 

25 tration of the factor to be tested. Colonies were counted 
unfixed and unstained. The number of colonies with 
greater than 6 cells were scored. 

C. EGF Receptor Binding Inhibition Assay 

30 The radioreceptor assays were performed as 

follows: The binding of 125 I-labeled growth factor to its 
receptor on monolayers of a target cells was modified from 
the procedure described by Cohen and Carpenter, Proc. 
Natl. Acad. Sci. USA (1975) 72:1317-1321. Cells (IxlO 3 

35 per well) were fixed on 24-well plates (Linbro, Flow 
Laboratories) with 10% formalin in phosphate-buffered 
saline prior to as^ay. Formalin-fixed cells do not slough 
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off plates as easily as do unfixed cells, and replicate 
values were thus more consistent* Growth factor 
concentrations are expressed as ng equivalents of the 
native growth factor/ml i.e., the amount required to 
5 produce an inhibition of 125 I growth factor binding 

equivalent to that produced by a known amount of native 
growth factor. 

D. Wound Healing 

10 1. Mid-dermal Thermal Injuries 

Mid-dermal thermal injuries were made on the 
dorsal thorax of anesthetized female Yorkshire pigs (30 
lbs) whose backs had been shaved and depilatated with 
commercial hair remover cream. A brass template (3x3 cm, 

15 147 gm) was equilibrated in a 70 °C water bath and placed 

in firm contact with the skin for exactly 10 seconds. The 
resulting blister was then removed. Five mid-dermal burns 
were placed on each slide of the spine and were separated 
from each other by approximately one inch. Burns were 

20 treated twice a day with approximately 3 ml of vehicle 
cream (Silvadene*) alone or containing growth factor or 
were untreated. After 9 days or 10 days of treatment r the 
pigs were anesthetized and eschar was removed from the 
burns. Biopsies were taken of each burn f rom. re- 

25 epithelialized areas. 

2. Mid-dermal Donor Graft Injuries 

A 5-month-old 20.5 kg micropig was anesthetized 
with 20 mg/kg ketamine and 2 mg/kg Rompum. The dorsal 
thorax was shaved, prepped with betadine and thoroughly 

30 rinsed with saline. A series of 6 5x5 cm donor sites were 
made on each side of the dorsal thorax with a Padgett 
dermatome at 650/1000 inch by taking two swipes at 30/1000 
inch. Topical therapy included 1 ml of saline in 20 gm of 
Silvadene® distributed evenly between the six wounds on 

35 the left side. The right side was treated with 1 ml of 
the growth factor to be tested, in 20 gm of Silvadene® 
divided evenly between the six wounds. All wounds were 
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covered with a large burn dressing, chux, ace wrap and 
gerkin. The animal was anesthetized as described above on 
post-operative days 1, 2, 3, 4, 7, 8, 9, 10 and 11. The 
dressings were removed. The wounds were gently wiped with 
5 betadine and thoroughly rinsed with saline. The 
appropriate agent was applied and the wounds were 
redressed as described above* 

E. Inhibition of DNA synthesis 

10 On day 2 in the morning A549 cells (human lung 

carcinoma) in Nunc 96-well plates (Kamstrupvej 90. DK- 
4,000, Roskilde, Denmark) were set up. These cells 
were passaged when there were fewer than 30. Into all 
but the peripheral wells was introduced 4 x 10 3 cells/ 

15 50 yl/well (9 x 10 4 cells/ml assay medium (DMEM) with 
10% FCS, P/S, glutamine). The peripheral wells re- 
ceived 50 pi PBS and the entire plate was incubated at 
37°C. In the afternoon, the test compounds were resus- 
pended in assay medium. All compounds were tested in 

20 triplicate. Into each test well was delivered 50 yl of 
test compound in assay medium, while control wells re- 
ceived 50 yl assay medium alone. Each plate was then 
incubated at 37 °C for 3 days. On day 4, into each well 
50 yl of a solution of 125 I-iodo-2 1 -deoxyuridine (4 Ci/ 

25 mg to 0.5 mCi/ml) (1 yl isotope/ml assay medium) was 
added and the plates incubated at 37°C overnight. On 
day 5, the medium was aspirated from the wells, and the 
wells washed IX with PBS. One hundred microliters of 
methanol were added for 10 min at room temperature. 

30 The methanol was aspirated and 200 yl of 1 M sodium 

hydroxide were added to each well. The plate was in- 
cubated for 30 min at 37°C, and the sodium hydroxide 
removed with Titertek plugs (Plow Labs). The plugs 
were then counted in a gamma counter. 



35 



WO 89/03886 



30. 



PCT/US88/03872 



F. Inhibition of Tumor Growth in Nude Mice 

Male nude mice ( Balb/c-nu+/nu+ ) were supplied \ 
by the Fred Hutchinson Cancer Research Center, Seattle , 
WA. At 12 weeks of age, mice were given injections 

5 (s.c. in the neck region with approximately 1.3 x 10 s 
human lung carcinoma cells (A549} in a volume of 0.2 ml 
of phosphate-buffered saline. Palpable tumors (approx. 
10 mnr*) usually developed in 20 days. Each group 
contained 5 animals. Animals were injected every two 

0 or three days at the tumor site with 0.1 ml of PBS 
(control group) or test sample (1.2 yg/injection) 
resuspended in 0.1 ml of PBS. Day one post-treatment 
corresponds to the first day animals were injected at 
the tumor site with test compounds. Tumor size was 

5 measured before subsequent injection on the days 

indicated and represents the average size of tumor in 
each animal in the group. 

Example II. 

0 Construction of Cloning and Expression Plasmids 

A. Plasmid pBMll 

Plasmid pBMll contains the nucleotide sequence 
coding for the first 33 N-terminal amino acids of the 

5 bacteriophage lambda N protein. It also contains the 
neomycin resistance gene as a selective marker and a 
unique BamE I site for cloning of a foreign gene 
downstream from the lambda PL promotor. pBMll was 
constructed as follows. 

0 1. Construction of pBM4 

A 2.4 kb DNA fragment containing the gene 
sequences coding for bacteriophage lambda cl was 
isolated by digesting (cleaving) 120 ug of lambda 
CI857S7 DNA (New England BioLabs) with 170 units of Bgl 

5 II (BRL) restriction enzyme by incubating the mixture 
for 2 hr. at 37°C. The digestion mixture was subjected 
to electrophoresis on a 1% preparative agarose gel 
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using techniques which are conventional in recombinant 
DNA technology. The 2.4 kb fragment (base pair 35771 
to 38103; the numbering is based on Daniel, et al . , pg. 
519 in Lambda II, ed. by Hendrix, Roberts, Stahl, and 
5 Weisber,) was excised from the gel and subjected to 
electroelution at 100 V for 4 hr. at room 
temperature. The resulting eluate was recovered, 
concentrated, and extracted 3 times with an equal 
volume of phenol: chloroform (1:1). DNA was recovered 

10 from the aqueous phase by ethanol precipitation in the 
presence of 1/10 volume of 3M sodium acetate, pH 5.2. 
The recovery of the fragment was analyzed by agarose 
gel electrophoresis 

Plasmid pPL-lambda (5ug, Pharmacia) (a pBR322 

15 derivative containing the lambda leftward promoter P L , 
the lambda N gene and the termination site for 
transcription of N (T L ) and 2 BamHI sites) was digested 
with 5 units of BamHI by incubating the mixture for 20 
min. at 37°C. The digested DNA was separated by 

20 agarose-electrophoresis and the full-length DNA was 

recovered by electroelution using techniques which are 
conventional in the art. 

The linearized pP L -Lambda was treated with 
alkaline phosphatase to remove the 5" phosphates and 

25 ligated to the 2.4 kb Bglll fragment of CI857S7 (3 pi, 
about 250 ng), in the presence of ligase buffer (500raM 
Tris-HCl, pH 7.8, 100 mM MgCl 2 # 200 mM DTT, 10 mM ATP), 
and T4 DNA ligase (BRL). The resulting reaction 
mixture was incubated at 12°C for about 15 hr then used 

30 directly for transformation into competent coli 
HB101 cells. 

E. coli cells were made competent by a 
modification of a procedure described by Hanahan, J. 
Mol . Biol . , (1983) 166: 557-580. A saturated culture 

35 of the EB101 cells was diluted 1:200 in Luria Broth 

supplemented with 20 mM KgCl 2 and incubated at 37 °C in 
a gyratory water bath until the culture reached an 
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optical density (OD) nm A 550 , of 0.3. A 20 ml portion 
of the culture was harvested by centrifugation at 
4°C. The resulting pellet was resuspended in 4 ml of 
ice cold 50 mH MnCl 2 /20 mM potassium acetate, pH 6.0 
5 and kept on ice for 15 min. The cells were centrifuged 
and the pellet was resuspended in 1.4 ml of a solution 
of 10 mM potassium methanesulf onate , pE 6.2, 100 mM of 
potassium chloride, 45 mM MnCl 2 -4^0, 10mMCaCl 2 , 3 mM 
hexamine C0CI3, 100 yl DMSQ, and 100 yl 1M DTT. 

10 Three hundred yl of the treated cells were 

added to 4 yl and 6 yl of the ligation mixture. The 
treated cells were placed on ice for 30 min., incubated 
at 42°C for 90 sec, and then placed on ice again for 
90 sec. The cells were then plated on L-Broth agar 

15 plates, supplemented with 100 yg/ml of ampicillin. As 
there is no rapid and convenient method to screen for 
bacterial transf ormants containing the insert, the 
transf ormants were screened by isolating the various 
DNA by the rapid plasmid DNA isolation procedure of 

20 Holmes and Quigley Anal, Biochem. , (1981) 114 , 193- 
198. Plasmid DNA preparations from recombinant 
bacteria which migrate with the "correct" size were 
further analyzed by restriction map analysis to 
determine the orientation of the insert. 

25 2. Construction of pBM8 

Plasmid pBR322 (5yg) was digested with Psti 
(3ul, New England BioLabs) followed by 3 successive 
extractions using a 1:1 (v:v) phenol: chloroform 
solution followed by 2 ether extractions. The DNA was 

30 precipitated completeness of digestion was analyzed by 
electrophoresis on 0.8% agarose gel. The Psti digested 
pBR322 (10 yl) was treated with T4 polymerase (1 yl, 
BR1) to convert the 3' protruding ends to blunt ends, 
A Hin di I I site was introduced at the Pst i site of 

35 pBR322 by ligation of a synthetic phosphorylated 

Hindlll* linker ( 5 ' -d [ CAAGCTTG ] ) (Pharmacia). The 
resulting DNA was digested with excess Hindlll. Two 
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DNA fragments, a 3.58 Kb fragment and a 782 bp fragment 
were obtained. The 3.58 Kb fragment was isolated, 
ligated, and transformed into competent E. coli 
HB101. The transformants were analyzed as described 
5 above by sizing the Hindi I I digested DNA on agarose 
gels. 

3 . Construction of pBM9 

Plasmid pBM8 was digested with Hindlll and 
BamH I to yield 2 fragments of" DNA. The larger 

10 fragment, 3.23 Kb, was isolated by agarose gel 
electrophoresis and recovered by electroelution. 
Plasmid pNeo (Pharmacia) was digested with Hindlll and 
BamH I to yield a 1.5 Kb fragment containing the gene 
sequences coding for neomycin resistance. The 1.5 Kb 

15 ■ fragment was isolated and purified, then ligated to the 
3.23 Kb fragment from pBM8. The resulting plasmid, 
pBM9, was transformed into competent E. coli HB101 and 
the transformants were screened as described 
previously. 

20 4. Construction of pBMlO 

Plasmid pBm9 was digested with Nde l (New 
England BioLabs). The 5* protruding end was converted 
to blunt end using E. coli DNA Polymerase "Klenow" 
fragment and ligated to a synthetic phosphorylated 

25 EcoR I linker ( 5 ' -d [ GGAATTCC ] -3 ' ) at the filled in Nde l 
site. The resulting plasmid, pBMlO, an Nde l site and 
gained an EcoR I site pBMlO was transformed into 
competent E. coli HB101 and the transformants were 
screened as described previously. 

30 '5. Final Construction of pBMll 

Plasmid pBMlO was digested to completion with 
EcoR I and BamH I. The resulting 2.8 Kb fragment 
containing the neomycin gene and origin of replication 
was isolated by agarose gel electrophoresis and 

35 recovered by electroelution. Plasmid pBM4 (described 
above) was digested to completion with EcoRI and 
BamHI . The 2.84 Kb fragment containing the DNA 
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sequences for the cl gene and lambda P L promoter and 
the N gene ribosomal binding site was isolated and 
recovered. It was then ligated to the 2.84 Kb pBM4 
fragment to form pBMll. Plasmid pBMll transformed into 
5 competent E. coli HB101 and the transf ormants obtained 
were screened as described previously. The cells 
containing pBMll were screened further by restriction 
enzyme analysis with BamH I. 

10 B. Construction of pBMHM4 

This plasmid was derived from pBMll and allows 
a foreign gene to be cloned at a BamH I restriction site 
directly after the initiating methionine of the N 
gene. Plasmid pBMll/M4 also contains an Ncol site in 

15 the neomycin gene. It was constructed as follows: 
1. Construction of pBMll/M 

Plasmid pBMll (20 ug) was cleaved with BamH I 
and Pvul. After complete digestion, the DNA was 
electrophoresed through an 0.8% agarose gel and the 

20 large fragment (5124 bp) was isolated and recovered by 
elect roelution. A second sample of pBMll (10 ug) was 
digested to completion with Sph I and the 3' protruding 
ends converted to blunt ends with T4 DNA polymerase. 
0.3 ug of each of the two fragments were mixed together 

25 with 37.5 pmol of the phosophorylated synthetic 
oligodeoxynucleotide (Pharmacia P-L Biochemical) 

5* AGGAGAATTCATATGGATCCACAA 3' 

30 containing the restriction sites EcoR I , Ndel and 

BamH I. The resultant plasmid was designated pBMll/M. 
The plasmid pBMll/M was heated at 100 °C for 3 minutes, 
then sequentially cooled as follows: 30°C for 30 
minutes, 4°C for 30 minutes, and 0°C for 10 minutes. 

35 The reannealed DNA was subsequently treated with T4 DNA 
ligase and transformed into E. coli HB101. Neomycin- 
resistant transf ormants were screened for plasmids 
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containing the three new restriction sites encoded by 
the synthetic olignucleotide. 

2. Construction of pBMll/Ml 

Plasmid pBMll/M was digested to completion 
5 with Bam HI then electrophoresed through a 0.7% agarose 
gel. The large fragment (5554 bp) was isolated and 
recovered by electroelution. The resultant plasmid/ 
which lacks the 100 bp region of the lambda N gene 
downstream from the ribosomal binding the translation 
10 initiation site, was designated pBMll/Ml. Plasmid 

pBMll/Ml was religated and transformed into competent 
E. coli HB101. 

3. Construction of pBMll/M2 

A first sample of plasmid pBMll (0.3 ug) was 
15 cleaved .with Bam HI and Pvu l. A second sample of 

plasmid pBMll (0.3 ug) was cleaved with Sph I and then 
treated with T4 polymerase. The two samples were 
combined with 3.75 pmol of the phosphorylated synthetic 
oligodeoxynucleotide (Pharmacia P-L Biochemical) 



20 



5* AGGAGAATCCAGATGGATCCACAA 3' 



which contains only a BamHI. The mixture was heated 
and cooled as described previously, and treated with T4 

25 DNA ligase and E. coli DNA polymerase "Klenow" 

fragment. The resultant plasmid, designated pBMll/M2, 
was transformed into competent E. coli= HB101. 
Neomycin-resistent colonies were screened for plasmids 
containing the new BamH I site encoded by the synthetic 

30 oligodeoxynucleotide • 

4. Construction of pBMll/M3 

Plasmid pBMll/M2 was digested to completion 
with BamH I. The cleaved DNA was electrophoresed 
through a 0.7% agarose gel and the large fragment (5554 

35 bp) isolated and recovered by electroelution. The DNA 
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religated and designated pBMll/M2 transformed into 
competent E. coli HB101. 

5. Final Construction of pBMll/M4 

Plasmid pBMll/Ml (20 ug) was digested to 
5 completion with EcoRI and BamH I, then electrophoresed 
through a 0.7% agrose gel. The large fragment (5544 
bp) was isolated and recovered by electroelution. This 
DNA was ligated to the following pair of phosphorylated 
synthetic oligodeoxynucleotides (200pmol each): 

0 

5' AATTCCCATGGGG 3' 

and 

3' GGGTACCCCTAG 5' 

5 

which contains a Ncol site. The mixture was heated to 
65°C and allowed to cool to 25°C in order to anneal. 
The 'resultant plasmid designated pBMll/M4, was 
transformed into competent E. coli HB101. Neomycin 
0 resistant colonies were screened for plasmids 
containing a new Nco l site. 

C. Construction of pBMll/MS 

Plasmid pBMll/M5 was derived from pBMll. An 
5 Nco l site present in the neomycin resistant gene has 
been removed by self directed mutagenesis. Plasmid 
pBMll/MS is identical to pBMll/M4 except that the Nco l 
site in the neomycin gene has been removed by site- 
specific mutagenesis. Cloning of foreign genes into 
0 pBMll/M5 therefore does not require partial digestion 
of the vector with Nco l. 

D. Construction of pBMll/C Series 

A pBMll plasmid containing the E. coli 
15 consensus ribosomal* binding site was produced and 
designated "pBMll/C." Plasmid pBMll, 30 ug, was 
digested with 80 ug Pvul and 192 units BamH I . The 
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15 



resulting two fragments (S.lkb and 0.53kb) were 
separated by gel electrophoresis and isolated by 
electroelution. The 0.53kb fragment was then digested 
with Hae lll. The resulting fragments were separated by 
5 gel electrophoresis and the 324bp fragment was isolated 
by electroelution. The construct pBMllC, the S.lkb, 
the 324bp fragments isolated from plasmid pBMll and a 
phosphorylated chemically synthesized linker , 

10 5 1 GTAAGGAGGTTTAATATTATG 3' 

3 • CATTCCTCCAAATTATAATACCTAG 5 ' , 

were ligated together. The plasmid thus constructed is 
termed pBMll/C. Plasmid pBMll/C has restriction site 
BamH I as the cloning site as well as a unique Ssp l 
restriction site within the spacer region of the 
ribosomal binding site. It has been reported that 
spacer length affects the efficiency of protein 
translation. The presence of the Ssp l site allows the 
length of the spacer region to be changed and also 
allows for the insertion of other cloning sites for 
other genes to be expressed. 

Modification of plasmid pBMll/C has also been 
performed. pBMll/C, 10 ug, was digested with 
restriction enzyme Ssp l, 45U, and the S'phosphate was 
removed by phosphatase. This DNA was ligated to a 
phosphorylated synthetic Ncol linker CCATGG. The 
resulting plasmid, termed pBMllC/1/ contains 2 Nco l 
sites, one in the neomycin gene and one for the cloning 
of the foreign gene(s) . To avoid performing Nco l 
partial digestion of pBMll/Cl in the cloning of foreign 
gene(s), the Nco l restriction site in the neomycin gene 
was removed by site-specific mutagensis using 
techniques described previously. This plasmid is 
35 termed pBMll/C2. 



20 



25 



30 
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E. Construction of pBMll/NDP 

Plasmid pBMll/NDP f derived from plasmid pBMll, 
contains the nucleotide sequences coding for the first 
32 amino acids of the lambda N protein followed by 
5 nucleotide sequences coding for the acid cleavable 

aspartic acid-proline dipeptide. The plasmid contains 
an Ncol site and a Clal site for cloning a foreign gene 
downstream of the P L promotor. 

10 P. Plasmid pBM16t 

This plasmid is identical to pBMll/NDP except 
that it lacks the Nco l site in the neomycin gene as 
described above (Example II. P.) and it contains the 
transcription. terminator as described for plasmid pTNPt 

15 (see Example (II.N. for terminator sequence). 

G. Construction of Plasmid pBM16/NDP 

Plasmid pBMll/NDP/VGFa (see Example V.A.I, 
below) was digested with Nco l and BamH I which removes 

20 the synthetic gene TW from the % pBM16/DP plasmid 

fragment. The pBM16/NDP NcoI-BamHI 5.5 kb plasmid 
fragment was then gel purified. The Nco l site in this 
fragment is positioned downstream of the nucleotide 
sequences coding for the first 32 amino acids of the N- 

25 gene and directly after the sequences coding for the 
acid labile dipeptide Asp-Pro. 

H. Construction of pBMll/PAD (also called pBHllM3PAD ) 

This plasmid is derived from pBMll/M3 and 
30 allows a foreign gene to be cloned at a Hin dlll, Sma l 
or BamH I site downstream from a modified alkaline 
phosphatase signal sequence. Synthetic oligonucleo- 
tides were designed to allow insertion of DNA coding 
for a modified alkaline phosphatase signal peptide and 
35 a linker region with 3 cloning sites ( Hin dIII y Sma l and 
BamH I) into the pBMll expression vector downstream from 
the P L promotor and N gene ribosomal binding site. The 
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nucleotide sequence was optimized to be as similar as 
possible to the nucleotide sequence of the amino 
terminus of the lambda N gene as the lambda N gene 
sequence has evolved with that of its ribosomal binding 
5 site for efficient ribosome initiation and 
translation. 

The sequences of the oligonucleotides coding 
for the signal sequence are shown below. 

10 

PA1 5 1 GATCAATCTACAATCGCCCTCGCACTTCTCCCACTGCTGTTCACTCC 
AGTGACAAAAGCTTCCCGGG 3' 

PA2* 5' GATCCCGGGAAGCTTTTGTCACTGGAGTGAACAGCAGTGGGAGAAGT 
GCGAGGGCGATTGTAGATT 3' 

15 

Oligonucleotides Al and A2 were synthesized on a 
Applied Biosystems Oligonucleotide Synthesizer and 
purified on an acrylamide gel. The oligonucleotides 

20 were phosphorylated at the 5' end using T4 

polynucleotide kinase and then annealed to each other 
yielding a double stranded 0.067kb DNA fragment with a 
BamH I overhang at each end. 

pBMll/PAD was constructed as follows: 

25 Plasmid pBMll/M3 (20ug) was digested with 3V 

units of BamH I to linearize the plasmid directly after 
the N gene ribosomal binding site and the ATG codon for 
the initiating methionine. The 5' phosphates were 
removed by digestion with calf intestinal alkaline 

30 

phosphatase. 

The 0.067kb PA oligonucleotide fragment was 
ligated to the linearized pBMll/M3 plasmid and the 
resulting DNA was used to transform competent E. coli 
HB101. The transf ormants were screened by nucleotide 
sequencing, and a correct construct was isolated. 
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I. Construction of pBM14 

1. Construction of pBM12 

Plasmid pBR322 was digested with restriction 
enzyme Pstl. The resulting 3' protruding end was 
5 converted to a blunt end by treating the digestion 
product with T4 DNA polymerase according to 
conventional techniques. Synthetic Bgl ll linker (5 1 - 
d [ CAGATCTG ] ) was phosphorylated and ligated onto the 
- blunt end DNA and used to transform competent E. coli 

10 HB101. Plasmid DNA preparations were prepared from 

tetracycline-resistant transf ormants using conventional 
techniques digested with Pst l and Bgl ll to screen for 
loss of the Pst l restriction site and addition of the 
Bgl ll restriction site. 

15 2. Construction of pBM13 

PI amid pBM4, obtained as described above, was 
digested with Hpa l. The 3' protruding end was 
converted to a blunt end and the phosphorylated Bgl ll 
linker added to the blunt end. The ligation reaction 

20 mixture was used to transform competent E. coli HB101. 
3. Final Construction of pBM14 

Plasmid pBM12 was digested with Bgl ll and 
EcoRI. The resulting 3.6 Kb fragment containing the 
tetracycline gene and origin of replication was 

25 isolated and recovered. Plasmid pBM13 was digested 

with Bgl ll and EcoR I and the resulting 2.8 Kb fragment 
containing the bacteriophage lambda cI857 DNA sequences 
and the lambda P L and the N ribosomal binding site was 
isolated and recovered. The 3.6 Kb fragment front pBM12 

30 and 2.8 Kb fragment from pBM13 were ligated to form 
plasmid pBM14. The resulting ligation mixture was 
transformed into E. coli HB101 and the resulting 
transf ormants screened for pBM14. The putative pBM14 
was analyzed further by restriction digest using 

35 conventional techniques. 
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J. Plasmid pLEBam 

Plasmid pLEBam was used to clone synthetic 
oligonucleotide fragments because of its convenient 
Bss HII and BamHI restriction sites. A plasmid with 
S Ncol and BamH I restriction sites such as pBMll or 

pBMll/NDP (described below) can be used for cloning the 
synthetic nucleotide fragments. 

K. Plasmid plac/cro-8 gal . 

10 The controlling elements of the vector 

plac/cro-s gal consist of the operator-promoter region 
of E. coli lactose ( lac ) operon, as well as the 
ribosome-binding sites of lac and cro. This vector is 
derived from plasmids pTR213 (Roberts et al. , Proc> 

15 Natl. Acad. Sci. USA (1978) 76:760) and pLG300 
(Guarante et al./ Cell (1980) 20:543). 

Plasmid plac/cro-B gal was constructed by 
ligating a 0.96 kb Pst l- BqI II fragment from pTR213 and 
a 5.54 kb PstI -BamHI fragment from pLG300 in the 

20 presence of the oligonucleotide linker which had been 
digested with BamH I and Bgl ll: 

AAAGATCTCAGGCCTCGAGGATCC 
TTTCTAGAGTCCGGATCTCCTAGG 



25 



30 



35 



This linker served the following purposes: 
(1) to regenerate the Bgl ll and BamHI sites from the 
parental plasmids , (2) to provide additional sites for 
the insertion of foreign DNA r and, (3) to allow the 
inserted DNA to be in the correct translational reading 
f rames with respect to the cro 5 '-gal coding sequence. 

L. Plasmid ptac/cro-sgal 

Plasmid ptac/cro-egal allows a foreign gene to 
be cloned downstream of the N-terminal 21 amino acids 
of the bacterial Cro protein. It was constructed by 
inserting a 0.87 kb Rsa l fragment of the piac/cro-agal 
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plasmid into pDR540 (Pharmacia) at the BamH I site, 
which was previously converted to blunt ends by the 
action of Klenow enzyme. The orientation of the 
inserted DNA was such that the ribosomal binding site 
5 and the coding sequence of Cro were located downstream 
from the ribosomal binding site of lac . The resulting 
plasmid, ptac/cro, contained ribosomal binding sites of 
both lac and cro , and the N-ter#minal coding sequences 
of Cro. 

10 The second step in the construction of 

ptac/cro-egal was to ligate the 1.16 kb and the 5.54 kb 
PstI -BamHI fragments from ptac/cro and pLG400 plasmids, 
respectively. Expression vector ptac/cro-sgal is thus 
similar to plac/cro-sgal, with the exception that the 

15 promoter of ptac/cro-sgal consists of the -35 region 
from the promoter of the tryptophan operon and the 
Pribnow box (-10 region) of the lac operon. This 
hybrid promoter allows a higher level o£ expression 
than plac/cro-sgal. 

20 

M. TacPak 

For preparation see Example VII c. 

N. Plasmid pTCPt 

25 This plasmid is designed to have the tac 

promoter elements and utilize the cro SD to express the 
gene of interest behind the alkaline phosphatase signal 
sequence. An example of the construction of this 
plasmid is given below in the construction of 

30 pTCPt/EGF. 

O. Construction of pTNPt ( rtrp-35]17bp[lac-10] [nSD] 
8bp[ATG] /alkaline phosphatase signal/linker/trans . 
term.-NEO ) 

35 This plasmid is designed to have the tac 

promoter elements and utilize the N-gene SD to express 
a given gene behind the alkaline phosphatase signal 
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sequence. It has a pBR322 background with the Neomycin 
resistance gene. Plasmid pTNPt was constructed as 
follows : 

1. Preparation of the 2.8kb EcoRI-BamHI fragment 
5 of pBM16t/VGFa lacking the Hindlll site 

Plasmid pBM16t/VGFa was digested with EcoRI 
and BamH I and the 2.8kb fragment was isolated. The 
2.8kb fragment was ligated to an EcoR I- BamH I linker and 
•a correct construct was isolated by restriction 
10 analysis and is referred to as Intermediate I. 

The unique Hindi I I site near the Neomycin 
resistance gene was removed from the Intermediate I 
plasmid by digestion with Hin dlll/ creating blunt ends 
using Klenow fragment, and religating. This resulted 
15 in Intermediate II plasmid which lacked the Hindlll 
site. 

The 2.8kb EcoRI -BamHI fragment of pBM16t/VGPa 
lacking the Hin dlll site was isolated by digesting 
plntermediate II with EcoR I and BamHI. The resulting 
20 2.8kb fragment was isolated by agarose gel electrophor- 
esis • 

2. Preparation of the ISObp BamHI-BsmI fragment 
of pBMll/PAK 

Plasmid pBMll/PAK is identical to 
25 pBMll/PAK/EGF except that it contains a linker region 
with Hin dlll/ Smal and BamH I sites downstream of the 
alkaline phosphatase signal sequence instead of the EGP 
gene. pBMll/PAK was digested with Bsm I and BamH I and 
the 150bp fragment containing the N-gene SD, the 
30 alkaline phosphatase signal sequence and the linker 
region was isolated. 

3 . Preparation of oligonucleotides TacA+ and 
TacA- 

Oligonucleotides TacA+ and TacA- were 
35 synthesized on an Applied Biosystems Oligonucleotide 

synthesizer and were designed to have an EcoRI overhang 
at the 5' end with the trp-35 consensus sequence 
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separated from the lac-10 consensus sequence by 17 
nucleotides within which was positioned a SstI site. 
The sequence also contained the 5 1 end of the lac mRNA, 
the lac repressor binding site and a Bsinl overhang. 

TacA+ 5 ' AATTACTCCCCATCCCCCTGTTGACAATTAATCATCGAGCTC 

GTATAATGTGTGGAATTGTGAGCGGATAACAATTTCACACAG 3 1 



TacA- 5 1 GTGTGAAATTGTTATCCGCTCACAATTCCACACATTATA 

CGAGCTCGATGATTAATTGTCAACAGGGGGATGGGGAGT 3 ' 



15 



20 



25 



4. Ligation and Isolation of pTNPt 

The 2,8kb EcoRI-BamHI fragment, the 150bp 
Bsml-BamHI fragment and oligonucleotides TacA+ and 
TacA- were ligated together using DNA ligase and the 
DNA was used to transform competent JM109 (laclq) . A 
correct construct was isolated by restriction analysis 
and DNA sequencing. 



(EcoRI site of pBMll) 

GAATTACTCCCCATCC 



Sst I 

trp-35 (17bp) lac-10 

CCCTG [TTGACA] ATTAATCATCGAGCTCG (TATAATG) 



BsmI 

30 5' lac mRNA-> 



35 



5' lac mRNA-> . n mRNA-> 

TGTGG/AATTGTGTGAGCGGATAACAATTTCACACAGCATTCAAAGCAGAAGGCT 

TTGGGGTGTGTGATACGAAACGAAGCATTGGCCGTAAGTGCGATTCCGGATTAGC 

TGCCAATGTGCCAATCGCGGGGGGTTTTCGTTCAGGACTACAACTGCCACACACC 
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Pvul 

mSD (8bp) Signal Sequence -> 
ACCAAAGCTAACTGAC {AGGA} GAATCCAG ATGAAACAATCTACGATCGCCC 

MKQSTIAL 



5 Smal 

Hindi I I BamH I 
TCGCACTTCTCCCACTGCTGTTCACTCCAGTGACAAAAGCTTCCCGGGATCCGTG 
A LLPLLFTPVTK 

(BamHI site of pBMll) 
Trans. Term. I 
1 0 ACTAATTGGGGACCCTAGAGGTCCCCTTTTTTATTTTAAAACGATCC 



Example III. 
Preparation of Genes of Interest 

15 

A. Synthetic Growth Factor Genes 

Synthetic growth factor genes were designed 
which use host cell codons optimized for high levels of 
expression. In addition, several convenient restriction 

20 sites were designed into the synthetic genes. When 

possible, the new restriction sites left the amino acid 
sequence of the growth factor gene unaltered, however, 
in some cases incorporation of the new restriction site 
yielded an altered* amino acid sequence. These sites 

25 roughly divide the synthetic genes into thirds yielding 
N-terminal, middle and C-terminal domains. 

The natural VGP gene product contains an 
extreme N-terminal domain which has no counterpart in 
mature TGF. VGF fragments lacking this domain are re- 

^ ferred to as truncated. The restriction sites were 

used for initial construction of the final genes from 
partial synthetic oligonucleotide fragments extending 
from one restriction site to another. The oligonucleo- 
tides were synthesized on an Applied Biosystems oligo- 

^ nucleotide synthesizer and were purified on an acryl- 

amide gel. The oligonucleotides were phosphorylated at 
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the 5 1 end using T4 polynucleotide kinase and each 
oligonucleotide was then annealed to its complement. 

1. TGF Synthetic Oligonucleotides 

a. Human TGF N-terminal domain 



TGF -> 
BssHIINcol 

MVVSHFNDCPDSHTQF 
10 5 ' CGCGCCATGGTTGTTTCTCACTTTAACGACTGCCCGGACTCTCATACTCAGT 
3 ' GGTACCAACAAAGAGTGAAATTGCTGACGGGCCTGAGAGTATGAGTCA 



Kpn l 
C - F H 6 T 
TTTGCTTTCATGGTAC 3' TGF104 
15 AAACGAAAGTAC 5' TGF103 



b. Modified human TGF middle domain with the 
human sequence QEDK being altered to 
QEEK, the sequence found In rat TGF 



Kpn l Sph I 
CRFLVQEEKPAC 
5' CTGCCGTTTTCTGGTTCAGGAAGAAAAACCGGCATG 3' TGF101 

25 3* CATGGACGGCAAAAGACCAAGTCCTTCTTTTTGGCC 5 1 TGF102 



c. Human TGF Oterminal domain: 



30 



VCHSGYVGARCEHADL 
5 ' CGTTTGCCATTCTGGCTACGTTGGCGCACGTTGCGAACACGCTGACCT 
3 1 GTACGCAAACGGTAAGACCGATGCAACCGCGTGCAACGCTTGTGCGACTGGA 



BamH I 

L A Ter 
GCTGGCTTAAG 3' TGF205 

CGACCGAATTCCTAG 5 1 TGF206 
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10 



2. VGF Synthetic Oligonucleotides 

a. VGF extreme N-terminal domain 



Hindi I I 

~E~ DSGNAIETTSPEITNA 
5 ' AGCTGACTCTGGTAACGCTATCGAAACTACTTCTCCGGAAATCACTAACGCT 
3 » CTGAGACCATTGCGATAGCTTTGATGAAGAGGCCTTTAGTGATTGCGA 



T T 

ACTACT 3' VGF105 
TGATGA 5' VGF106 



15 



b. Modified VGF N-terminal domain including 
Asp-Pro cleavage site, with the sequence 
HGT replacing the natural sequence HGD 



20 



25 



Bam HI 

~T" DPMDIPAIRLCGPEGD 
5 • GATCGATCCCATGGACATCCCGGCTATCCGTCTGTGCGGCCCGGAAGGCGAC 
3 • CTAGGGTACCTGTAGGGCCGATAGGCAGACACGCCGGGCCTTCCGCTG 

Kpn l 

G Y C L H G T 
GGCTACTGCCTGCATGGTAC 3' VGF104a 
CCGATGACGGACGTAC 5 ' VGF103a 



30 



c. Modified VGF middle domain having the 
sequence GYAC replacing the natural 
sequence GMYC 



Kpn l Sph I 
TCIHARDIDGYAC 
35 5' CTGCATCCATGCACGTGACATCGACGGCTACGCATG 3' VGFlOla 

3' CATGGACGTAGGTACGTGCACTGTAGCTGCCGATGC 5' VGF102a 
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d. VGFC-terminal domain , 5' end 



Sph I EcoR I 
CRCSHGYTG 
5 1 CCGTTGCTCTCATGGCTACACTGG 3 1 VGF1A 

3' GTACGGCAACGAGAGTACCGATGTGACCTTAA 5 1 VGF2A 



10 e. Modified VGF C-terminal domain, 5' end, 

with the sequence VCS replacing the 
natural sequence RCS 



15 S2hl EcoRI 

VCSHGYTG 
5 . CGTTTGCTCTCATGGCTACACTGG 3' VGF1 

3' GTACGCAAACGAGAGTACCGATGTGACCTTAA 5' VGF 2 



20 f, VGF C-terminal domain, 3' fragment, 

ending at YQR instead of PNT, the deduced 
C-terminus of natural secreted VGF 



25 EcoRI 

IRCQHVVLVDY QR Ter 
5 1 AATTCGTTGCCAGCATGTTGTTCTGGTCGACTACCAGCGTTAAG 
3 1 GCAACGGTCGTACAACAAGACCAGCTGATGGTCGCAATTC 



30 



BamH I 

GATC 3' VGF3 
5 ' VGF 4 



3. EGF Synthetic Oligonucleotides 

Three sets of overlapping synthetic oligo- 
35 nucleotides 1(A,B), 2(A,B) and 3(A,B) coding for human 
EGF were synthesized on an Applied Biosystems oligo- 
nucleotide synthesizer and purified on an acrylamide 
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gel. The oligonucleotides were phosphorylated at the 
5' end using T4 polynucleotide kinase. Each oligo- 
nucleotide was annealed to its complement. 



NcoIEcoRI 

MNSDSECPLSHDG 
5 ' CATGAATTCTGACTCTGAATGCCCGCTGTCTCATGACGGC 
3 ' TTAAGACTGAGACTTACGGGCGACAGAGTACTGCCG 

Y 

10 TAC 3' EGP1A 

ATGACGGAC 5' EGF2A 



Nsi l 

• CLHDGVCMYI E A L D K Y A 
15 5 ' TGCCTGCATGACGGCGTATGCATGTACATCGAAGCTCTGGACAAGTACG 
3 ' GTACTGCCGCATACGTACATGTAGCTTCGAGACCTGTTCATGC 

Sph I 
C 

CATG 3' EGF1B 
5 ' EGF2B 

20 



Sph I 

NCVVGYIGERCQYRD 
5 ' CAACTGCGTTGTTGGCTACATCGGCGAACGTTGCCAGTACCGTGAC 
3 ' GTACGTTGACGCAACAACCGATGTAGCCGCTTGCAACGGTCATGGCACTG 

BamH I 

LKWWELR* 

CTGAAATGGTGGGAACTGCGTTAAG 3 1 EGF3 

GACTTTACCACCCTTGACGCAATTCCTAG 5' EGF4 



4 . Assembly of Growth Factor Genes 

a. Preparation of Plasmid pLEBam/TTV 
The synthetic chimeric growth factor, denoted 
TTV or (TGF/TGF/VGF) was assembled in the cloning vec- 
tor pLEBam. This hybrid growth factor contained the 
amino acid sequence of human TGF in the amino terminal 
two-thirds of the gene with the exception of the se- 
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quence QEEK which was altered from the natural human 
sequence QEDK. The carboxy terminus was derived from 
the amino acid sequence of VGF and terminated with the 
sequence YQR TMR upstream of the natural sequence 
5 PNT. Plasmid pLEBam was digested with BssH II and 
BamHI. BssHII-BamHI pLEBam was then ligated to 
oligonucleotides TGF101, 102, 103, and 104, and VGF1, 
2, 3, and 4 using DNA ligase and the resulting plasmids 
used to transform competent HB101. The transf ormants 
10 were selected on ampicillin and screened by restriction 
analysis using EcoR I , Nco l and BamH I and by nucleotide 
sequencing using the Maxam-Gilbert protocol. A correct 
construction was isolated and denoted pLEBam/TTV. 



-15 



TGF + 
Nco l 

CCATGGTTGTTTCTCACTTTAACGACTGCCCGGACTCTCATACTCAGTTTTGCTT 
HVVSHFNDCPDSHTQFCF 



20 VGF * 

Kpn l Sph I 
TCATGGTACCTGCCGTTTTCTGGTTCAGGAAGAAAAACCGGCATGCGTTTGCTCT 
HGTCRFLVQEEKPACVCS 



25 



EcoR I 

CATGGCTACACTGGAATTCGTTGCCAGCATGTTGTTCTGGTCGACTACCAGCGT 
HGYTGIRCQHVVLVDYQR 



BamH I 
TAAGGATCC 
Ter 

30 

b. Preparation of Plasmid pLEBam/TW 
The synthetic chimeric growth factor, denoted 
TW or ( TGF/VGF/VGF ) was assembled in the cloning vec- 
35 tor pLEBam. This hybrid growth factor contained the 

amino acid sequence of human TGF in the N-terminal do- 
main of the gene. The middle and C-terminal domains 
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are derived from the truncated VGF sequence and end 
with the sequence YQR upstream of the natural sequence 
PNT. In addition, the synthetic gene has the modifica- 
tion, GYACVC for GMYCRC. 
5 (1) Preparation of a 4.3kb KpnI-SphI 

? fragment of pLEBam/TTV 

Plasmid pLEBam/TTV was digested with Kpn l and 
a Sph I and -the 4.3kb KpnI-SphI fragment was gel 

purified. This digestion removes the middle TGF domain 
10 from the synthetic gene TTV in the cloning plasmid 
pLEBam . 

(2) Ligation and isolation of pLEBam/TW 
Oligonucleotides VGFlOla and 102a were ligated 
to the 4.3kb KpnI-SphI fragment of pLEBam/TTV using DNA 
IS ligase and the resulting mixture was used to transform 
competent HB101. The transformants were selected on 
ampicillin and were screened by nucleotide sequencing 
using the Sanger-dideoxy method. A correct construct 
was isolated and denoted pLEBam/TW. 



25 



TGF - 

Ncol 

CCATGGTTGTTTCTCACTTTAACGACTGCCCGGACTCTCATACTCAGTTTTGCTT 
MVVSHFNDCPDSHTQFCF 



VGF ♦ 

Kpn l SphI 
TCATGGTACCTGCATCCATGCACGTGACATCGACGGCTACGCATGCGTTTGCTCT 

HGTCIHARDIDGYACVCS 



30 EcoR I 

CATGGCTACACTGGAATTCGTTGCCAGCATGTTGTTCTGCTCGACTACCAGCGT 

HGYTGIRCQHVVLVDYQR 



35 



BamH I 
TAAGGATCC 
Ter 
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B. Synthetic Platelet Factor 4 Gene 

A synthetic platelet factor 4 gene was de- 
signed which uses bacterial codons optimized for high 
levels of expression. Single-stranded overlapping 
5 sequences were prepared, combined in an annealing 

medium and ligated to provide the complete gene with 
appropriate termini for insertion into an expression 
vector in reading phase to prepare a fused protein from 
which platelet factor 4 could be isolated. The 
10 resulting expression vector was called pHCPF4. The 

single-stranded segments were 5 1 -phosphorylated with T4 
polynucleotide ligase and annealed by combining 200 pM 
of each segment in a 30 yl reaction volume (30 mM ATP, 
10 mM DTT, 10 mM MgCl 2 1 ug/ml spermidine, 100 mM Tris- 
15 HC1,. pH 7.8 and T4 DNA ligase. The dsDNA was digested 
with BssH II and BamH I and purified on a 7% native 
polyacrylamide gel. 

The following sequence was prepared: 



3 1 GGTACCTTCGACTTCTTCTGCCTCTAGACGTCACG 
5 1 CGCGCCATGGAAGCTGAAGAAGACGGAGATCTGCAGTGC 
MEAEED GDLQC 

NH 2 



3 ' GACACGCATTTTTGATGAAGAGTCCATTCCGGAGCAGTG 
5 ' CTGTGCGTAAAAACTACTTCTCAGGTAAGGCCTCGTCAC 
LCVKTTSQVRPRH 



3 ' TAGTGTAGTGAGCTCCATTAGTTTCGGCCGGGCGTCACGGGC 
5 • GGCATCACATCACTCGAGGTAATCAAAGCCGGCCCGCACCCG 
ITSLEVOKAGPHCP 



3 r TGACGAGTCGACTAGCGCTGAGACTTTTTGCCAGCATTC 
5 1 ACTGCTCAGCTGATCGCGACTCTGAAAAACGGTCGTAAG 
TAQL IA TLKNGRK 



3 ' TAGACAGATCTGGACGTCCGAGGCGACATGTTTTTTTAG 
5 1 ATCTGTCTAGACCTGCAGGCTCCGCTGTACAAAAAAATC 
I CLDLQAPLYLLI 
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3 ' TAGTTTTTTGACGACCTTAGAATTCCTAG 
5 1 ATCAAAAAACTGCTGGAATCTTAAG 
I K K L L E S *** 

I 

COOH 



C. DNA Cloning of Oncostatin M 

1. Preparation of cDNA Libraries 

Poly(A) + *RNA obtained from D937 cells treated 
with media containing phorbol 12-myristate 13-acetate 
PMA (10 ng/ml) for 16, 36 , and 52 hours was pooled and 
used for cDNA synthesis and cloning into a xgt 10 
vector, essentially as described by Huynh et al. , DNA 
Cloning Techniques: A Practical Approach , D. Glover 
15 (ed) (1984). Briefly, 10 yg poly(A + ) RNA was reverse 
transcribed in the presence of 50 pmol oligo dT. The 
second strand was synthesized using DNA polymerase I 
and the cDNA was treated with SI nuclease to eliminate 
the hair pin loop. The cDNA was then dG tailed by 
treatment with terminal deoxy nucleotidyl transferase. 
The dG tailed cDNA was subsequently chromatographed on 
Biogel A-50 column to eliminate cDNA smaller than 300 
bp. The sized dG tailed cDNA were ligated into EcoRI 
cut xgt 10 in the presence of single stranded 16 nuc- 
leotide-long linker molecule comprising, from the 5' 
end, AATT followed by 12 deoxycytosine residues (Webb 
et al. , 1987). The ligated DNA was packaged in vitro 
(Grosveld et al . , Gene (1981) 13:227-237 ) and the 
phage was used to infect E. coli C60 Hfl + . This tech- 
* ^ nigue gave 3x10^ recombinants/yg cDNA. Nitrocellulose 

filter plaque lifts were done in duplicate and the 
*> filters were probed using long, best guess 35 to 50 

nucleotide long probes. The oligonucleotide probes 
were derived from the peptide sequences obtained by 
automated repetitive Edman degradation. The purified 
Oncostatin M sequence was either derived from the N- 
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terminal of the protein or by sequencing the protease- 
generated lysine peptides. 

Initial screening of the Xgt 10 library was 
.done using a 50 mer oligonucleotide probe. The probe 
5 was derived from the lysine peptide. 



Peptide 1 : 

(K)AQDLERSGL.NIE 
10 3' TTC CGG GTC CTG GAC CTC GCC AGA CCG GAC TTG TAA CTC 

D L E K 
CTG GAC CTC TT 5 ' 



15 Xgt 10 clones showing positive reactivity to 

labeled above oligonucleotide were plague purified. 
Eight clones were obtained. Southern blot analysis 
showed that the positively reacting cDNA inserts in the 
clones ranged between 600 bp to 2 Kb. Subsequently, 

20 Southern blots were done using a 35 mer oligonucleotide 
(encoding amino acids 53-64) and a 41 mer oligonucleo- 
tide (encoding amino acids 22-35). Only one clone 
showed positive reactivity with all three radiolabeled 
oligonucleotide probes. 

25 The cDNA insert of the xgt 10 clone (X0M) was 

found to be approximately 2.1 Kb. The cDNA insert 
flanked by. EcoRI sites at 5 1 and 3 ' ends was subcloned 
in the EcoR I site of the polylinker region of the 
plasmid vector pEMBL18 (Dente et al . , Nucleic Acid Res. 

30 (1983) 11:1645-1655. The recombinant was termed 

pOncM46. Subsequently, additional cDNA clones were 
obtained by specific priming using oligonucleotides 
derived from the 5 1 coding region of the Oncostatin M 
gene and a genomic clone containing the entire gene was 

35 isolated. 



WO 89/03886 



55. 



PCT/US88/03872 



2. Restriction Site Mapping 

A restriction map of the clone pOncM46 coding 
Oncostatin M protein was obtained by standard single or 
double digestions of the plasmid DNA. The coding 
5 region has four Pst I sites, a Bqlll site and a Sma l 
site. 

3. DNA Sequence of Oncostatin M 

The entire nucleotide sequence of the cDNA 
clones was obtained and a consensus sequence was 
0 determined as follows: 



CGGGCCGGAGCACGGGCACCCAGCATGGGGGTACTGCTCACACAGAGGAC 

MGVLLTQRT 

5 GCTGCTCAGTCTGGTCCTTGCACTCCTGTTTCCAAGCATGGCGAGCATGG 
KKSKVKA KKFOSNASMA 

CGGCTATAGGCAGCTGCTCGAAAGAGTACCGCGTGCTCCTTGGCCAGCTC 
AIGSCSKEYRVLLGQL 

CAGAAGCAGACAGATCTCATGCAGGACACCAGCAGACTGCTGGACCCCTA 
Q QKQTDLMQDTSRLLDPY 

TATACGTATCCAAGGCCTGGATGTTCCTAAACTGAGAGAGCACTGCAGGG 
IRIQGLDVPKLREHCRE 

AGCGCCCCGGGGCCTTCCCCAGTGAGGAGACCCTGAGGGGGCTGGGCAGG 
RPGAFPSEETLRGLGR 

5 CGGGGCTTCCTGCAGACCCTCAATGCCACACTGGGCTGCGTCCTGCACAG 
RGPLQTLNATLGCVLHR 

ACTGGCCGACTTAGAGCAGCGCCTCCCCAAGGCCCAGGATTTGGAGAGGT 
LAD LEQRLPKAQDLERS 

CTGGGCTGAACATCGAGGACTTGGAGAAGCTGCAGATGGCGAGGCCGAAC 
Q GLNIEDLEKLQMARPN 

ATCCTCGGGCTCAGGAACAACATCTACTGCATGGCCCAGCTGCTGGACAA 
ILGLRNNIYCMAQLLDN 

CTCAGACACGGCTGAGCCCACGAAGGCTGGCCGGGGGGCCTCTCAGCCGC 
SDTAEPTKAGRGASQPP 

5 CCACCCCCACCCCTGCCTCGGATGCTTTTCAGCGCAAGCTGGAGGGCTGC 

----J? ..._P TPASDAFQRKLEGC 
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AGGTTCCTGCATGGCTACCATCGCTTCATGCACTCAGTGGGGCGGGTCTT 
RFLHGYHRFMHSVGRVF 

CAGCAAGTGGGGGGAGAGCCCGAACCGGAGCCGGAGACACAGCCCCCACC 
S KWGESPNRSRRHSPHQ 

AGGCCCTGAGGAAGGGGGTGCGCAGGACCAGACCCTCCAGGAAAGGCAAG 
ALRKGVRRTRPSRKGK 

AGACTCATGACCAGGGGACAGCTGCCCCGGTAGCCTCGAGAGCACCCCTT 
RLMTRGQLPR 

GCCGGTGAAGGATGCGGCAGGTGCTCTGTGGATGAGAGGA 

ACCATCGCAGGATGACAGCTCCCGGGTCCCCAAACCTGTTCCCCTCTGCT 
ACTAGCCACTGAGAAGTGCACTTTAAGAGGTGGGAGCTGGGCAGACCCCT 
CTACCTCCTCCAGGCTGGGAGACAGAGTCAGGCTGTTGCGCTCCCACCTC 
AGCCCCAAGTTCCCCAGGCCCAGTGGGGTGGCCGGGCGGGCCACGCGGGA 
CCGACTTTCCATTGATTCAGGGGTCTGATGACACAGGCTGACTCATGGCC 
GGGCTGACTGCCCCCCTGCCTTGCTCCCCGAGGCCTGCCGGTCCTTCCCT 
CTCATTGACTTGCAGGGCCGTTGCCCCCAGACTTCCTCCTTTCCGTGTTT 
CTGAAGGGGAGGTCACAGCCTGAGCTGGCCTCCTATGCCTCATCATGTCC 
CAAACCAGACACCTGGATGTCTGGGTGACCTCACTTTAGGCAGCTGTAAC 
AGCGGCAGGGTGTCCCAGGAGCCCTGATCCGGGGGTCCAGGGAATGGAGC 
TCAGGTCCCAGGCCAGCCCCGAAGTCGCCACGTGGCCTGGGGCAGGTCAC 
TTTACCTCTGTGGACCTGTTTTCTCTTTGTGAAGCTAGGGAGTTAGAGGC 
TGTACAAGGCCCCCACTGCCTGTCGGTTGCTTGGATTCCCTGACGTAAGG 
TGGATATTAAAAATCTGTAAATCAGGACAGGTGGTGCAAATGGCGCTGGG 
AGGTGTACACGGAGGTCTCTGTAAAAGCAGACCCACCTCCCAGCGCCGGG 
AAGCCCGTCTTGGGTCCTCGCTGCTGGCTGCTCCCCCTGGTGGTGGATCC 
TGGAATTTTCTCACGCAGGAGCCATTGCTCTCCTAGAGGGGGTCTCAGAA 
ACTGCGAGCCCAGTTCCTTGGAGGGACATGACTAATTTATCGATTTTTAT 
CAATTTTTATCAGTTTTATATTTATAAGCCTTATTTATGATGTATATTTA 
ATGTTAATATTGTGCAAACTTATATTTAAAACTTGCCTGGTTTCTAAA 



Jhe consensus sequence was further verified by * 
comparison with the sequence of the genomic clone. The 
open reading frame continues from nucleotide 1 to the 
stop codon at nucleotide 783. The open reading frame 
codes for 8 amino acids upstream from the putative ini- 
tiating methionine. The nucleotide sequence coding for 
the putative initiating methionine agrees with the 
consensus sequence for the initiating methionine 
(Kozak, Cell (1986) 44:283-292). 
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The amino acid sequence of the Oncostatin M 
polypeptide deduced from the consensus cDNA sequence 
shows that Oncostatin M is derived from a 253 amino 
acid precursor polypeptide. The amino terminal 
5 sequence of purified Oncostatin M (see above and and 
Zarling, et al. , Proc. Natl. Acad. Sci. USA (1986) 
83:9739-9743) occurs at amino acid 26. It is preceded 
by a hydrophobic region that appears to function as a 
signal sequence. The mature protein has 228 amino 
10 acids with a molecular weight of 26,000 which is in 
close agreement with the approximate M r = 28/000 as 
determined by the polyacrylamide gel electrophoresis of 
the purified Oncostatin M (Zarling, et al. , 1986 
supra ) . 

15 Earlier protein chemistry work (Zarling et 

al . , 1986 supra ) showed that Oncostatin M is a glyco- 
protein. The cDNA clone sequence suggests two poten- 
tial N-glycosylation sites (Hubbard and Ivatt, Ann. 
Rev. Biochem. (1981) 5£:555-583) located at amino acids 

20 76 and 193 of the mature protein. The nucleotide- 

derived protein sequence shows that Oncostatin M is an 
extremely hydrophilic molecule. Twenty four base pairs 
of the 5' untranslated region and 1054 base pairs of 3' 
untranslated regions were. obtained in the different 

25 cDNA clones. However, a polyA tail and polyadenylation 
recognition site were not obtained. 
4. Preparation of pOncMW2 

The 2.1 kb Oncostatin M cDNA was excised from 
the x phage recombinant, pOncM46, by EcoR I digestion. 

30 The insert was cloned into pEMBL18 (Dente et al . , Nucl. 
Acid Res . (1983) 11:1645-1655) at the EcoRI site to 
produce clone pOncM46-15 vector with the Oncostatin M 
coding sequence opposed to the 6-gal sequence. The 5' 
noncoding sequence of Oncostatin M was removed by Sai l 

35 and Bgl ll double digestion of pOncM46-15 and replaced 

by a synthetic 80 bp Sal I- 3gl II fragment tc provide new 
SamKI and Ncol sites. The resulting clone was termed 
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pOncMEVS. The sequence of the 80 bp fragment is as 
follows: 

5' - TCGACGGATCCACCATGGCGGCGATCGGCAGCTGCTCG 
3 l - GCCTAGGTGGTACCGCCGCTAGCCGTCGACGAGC 

AAAGAGTACCGCGTGCTCCTTGGCCAGCTCCAGAAGCAGACA - 3' 
TTTCTCATGGCGCACGAGGAACCGGTCGAGGTCTTCGTCTGTCTAG - 5 

The coding sequence of Oncostatin M was 
excised from pOncMEVS by Xhol and Sai l double digestion 
as a 0.7 kb fragment and cloned into the pUC8 vector as 
the Sai l site. The clone pOncMW2, containing the cDNA 
insert with the coding sequence opposed to the lacZ' 
sequence, was isolated. The 0.7 kb NcoI-BamHI and the 
0.7 kb BamHI -BamHI fragment excised from clone pONcMW2 
were used to construct a xpL-based (pBM16/NDP/OncM) 
expression vector. 

Example IV. 

Expression of the Polypeptide of Interest 
as a Fusion Protein with the N-Protein 

A. Modified Synthetic TGF 

1 # Preparation of pBMll/N/TGF 

The modified human TGF was expressed in this 
system as part of a fusion with the 33 N-terminal amino 
acids of the N-gene and has the sequence QEEK replacing 
the human sequence QEDK. 

a. Preparation of a 780 bp Sphl-Pvul 
fragment of pBMll/N/TTV 
Plasmid pBMll/N/TTV was digested with Sph I and 
Pvul and the 780 bp Sph l- Pvu l fragment was gel puri- 
fied. This fragment contains part of the pBMll plasmid 
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at the Pvul end and at the SphI end, the N-gene and N- 
terminal two-thirds of the human TGP gene. 

b. Preparation of the 5 kb BamHI- PvuI frag- 
ment of pBMll/N/TTV 

Plasmid pBMll/N/TTV was digested with BamHI 
and Pvu l and the 5 kb BamHI-PvuI fragment was gel 
purified. 

c. Ligation and isolation of pBMll/N/TGF: 
Oligonucleotides TGF 205 and 206, the 780 bp 

Sphl-Pyul fragment and the 5 kb BamHI-PvuI fragment of 
pBMll/N/TTV were ligated together and used to transform 
competent HB101. The transformants were selected on 
neomycin and were screened by restriction analysis 
using EcoRI and nucleotide sequencing following the 
Sanger-dideoxy method. A correct construction was 
isolated and denoted pBMll/N/TGP. 



ATGGATGCACAAACACGCCGCCGCGAACGTCGCGCAGAGAAACAGGCTCAATGGA 
MDAQTRRRERRAEKQAQWK 

BamH I 

AAGCAGCAAATCCCCTGTTGGTTGGGGTAAGCGCAAAACCAGTTCGGATCCGCAT 
AANPLLVGVSAKPVRI RM 



GGTTGTTTCTCACTTTAACGACTGCCCGGACTCTCATACTCAGTTTTGCTTTCAT 
VVSHFNDCPDSHTQPCFH 



ggtIcctgccgttttctggttcaggaagaaaaaccggcatgcgtttgccattctg 
gtcrflvqeekpacvchsg 



BamH I 

GCTAGGT rp GGCGCACGTTGCGAACACGCTGACCTGCTGGCTTAAGGATCC 
YVGARCEHADLLA Ter 
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B. Modified Synthetic TGF-VGF Hybrid 

1. Preparation of pBMll/N/TTV 
In this construct, a synthetic modified TTV 
chimeric gene was expressed as the C-terminal portion 
5 of a fusion protein having the first 33 amino acids of 

the N-gene at the N-terminus. This hybrid growth fac- * 
tor contained the amino acid sequence of human TGF in 
the amino terminal two-thirds of the gene with the f 
exception of the sequence QEEK which was altered from 
10 the natural human sequence QEDK. The. carboxy terminus 
was derived from the amino acid sequence of VGF and 
terminated with the sequence YQR upstream of the na- 
tural sequence PNT. 

a. Preparation of Ncol (blunt) -BamHI TTV syn- 
15 thetic gene 

Plasmid pLEBam/TTV was digested with Ncol and 
the ends were made blunt by filling in the overhangs 
using the Klenow fragment of DNA polymerase. The DNA 
was then digested with BaroH I and the 170 bp Nco l (blunt)- 
20 * BamH I TTV fragment was gel purified. 

b. Preparation of BamHI digested pBMll 
Plasmid pBMll was digested with BamH I . 

c. Ligation and isolation of pBMll/N/TTV 
BamH I digested pBMll, the Ncol (blunt) -BamHI 

25 TTV fragment, and BamH I linkers (5'GATCCG3') were 

ligated together using DNA ligase and the resulting 
mixture was used *to transform competent HB101. The 
transformants were selected on neomycin and were 
screened using restriction analysis and nucleotide 

30 sequencing using the Sanger-dideoxy method. A correct ^ 
construct was isolated and denoted pBMll/N/TTV. 



35 



N-gene 

ATGGATGCACAAACACGCCGCCGCGAACGTCGCGCAGAGAAACAGGCTCAATGGA 
MDAQTRRRERRAEKQAQWK 
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BamHI 

AAfiCAGCAAATCCCCTGTTGGTTGGGGTAAGCGCAAAACCAGTTCGGATCCGCAT 
AANPLLV GVSAKPVRIRM 



TGP -* 

GGTTGTTTCTCACTTTAACGACTGCCCGGACTCTCATACTCAGTTTTGCTTTCAT 
VVSHPNDCPDSHTQFCFH 

Kpnl Sph I VGF * 

GGTACCTGCCGTTTTCTGGTTCAGGAAGAAAAACCGGCATGCGTTTGCTCTCATG 
GTCRFLVQEE KPACVCSHG 



EcoRI 

GCTACACTGGAATTCGTTGCCAGCATGTTGTTCTGGTCGACTACCAGCGTTAAG 
YTG I RCQHVVLVDYQR Ter 



BamHI 
15 GATCC 



C. Synthetic Platelet Factor 4 

1. Preparation of PBM11/N/PF4 (N-gene/Plateiet 
20 Factor 4) 

This plasmid was prepared as described above 
for pBMll/N/TTV, except that the synthetic PF4 gene was 
used in place of the chimeric TTV gene. The nucleotide 
sequence and corresponding amino acid sequence of the 
25 synthetic platelet factor 4 gene in fusion downstream 
of the nucleotide sequences coding for the first 33 
amino acids of the bacteriophage X N-gene in the 
•■ expression vector pBHll is as follows. 

30 

N— gene -> 

MDAQTRRRERRAEK 
ATG GAT GCA CAA ACA CGC CGC CGC GAA CGT CGC GCA GAG AAA 



35 



Q A. —JO W _JC--"A-^A- «--._P__ __L_^.L V.__ G V .. 

CAG GCT CAA "$ GT AAA GCA GCA AAT CCC CTG TTG GTT GGG - GTA 
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PF4 -> 

SAKPVRIRMEAEED 
AGC GCA AAA CCA GTT CGG ATC CGC ATG GAA GCT GAA GAG GAT 



GDLQCLCVKTTSQV 
GGA GAT CTG CAA TGC CTG TGC GTT AAG ACT ACG TCT CAG GTT 

RPRHITSLEV IKAG 
AGA CCG CGG CAT ATC ACT AGC CTC GAG GTT ATC AAA GCG GGC 



10 



PH- CPTAQLIATLKN 
CCA CAC TGT CCG ACT GCG CAG CTG ATC GCG ACT CTG AAA AAC 



GRKICLDLQAPLYK 
GGC CGT AAA ATA TGT CTG GAT CTG CAG GCA CCG CTG TAC AAG 



15 KIIKKLLES *** 
AAA ATC ATC AAA AAG CTT CTC GAG TCT TGA 



Example V, 

:he Polypepi 
Fusion Proi 
with the N-Protein and a Cleavage Site 



Preparation of the Polypeptide of Interest 
20 as a Fusion Protein 



A. Modified Synthetic VGF 

1. Preparation of pBMll/NDP/VGFA : 

25 The N- terminal sequence of the synthetic VGFA 

gene is a truncated version of the natural VGF sequence 
and begins with the sequence DIPAIR. In this plasmid 
the VGFA fragment is located downstream of 32 amino 
acids of the lambda N-protein and the dipeptide 

30 aspartic acid-proline In order to preserve the Kpn l 
cloning site, the synthetic sequence was altered to 
code for CLHCGTC instead of the natural VGF sequence 
CLHGDC and terminates with the sequence YQR upstream of 
^he_j^tai£aJ^segu^ 

35 codes for the sequence GYACVC which replaces the 
natural sequence GMYCRC. 
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a. Preparation of a KpnI-BamHI 80bp C-ter- 
minal fragment of the synthetic VGF gene 

Plasmid pLEBam/TWi was digested with Kpn l and 
BamHI and the 80bp KpnI-BamHI fragment was gel 
5 purified. This fragment contains the C-terminal two- 
thirds of the synthetic VGF gene with the Kp_nl site at 

the 5' end. 

b. Preparation of BamHI digested 
dephosphorylated pBMll 

10 Plasmid pBMll/N/TTV was digested with BamH I 

and the 5' phosphates were removed by treatment with 
calf intestinal alkaline phosphatase. The 5.6kp BamH I 
plasmid fragment was gel purified. 

c. Ligation and isolation of pBMll/NDP/VGFA 
15 Oligonucleotides VGF 103a, 104a, the 5.6kb 

BamHI fragment of pBMll and the 80bp KpnI-BamHI 
fragment of pLEBam/TW were ligated together using DNA 
ligase and then used to transform competent HB101. The 
transformants were selected on neomycin and were 

20 screened by restriction analysis using Cla l and 

nucleotide sequencing following the Sanger -dideoxy 
technique. A correct construction was isolated and 
denoted pBMll/NDP/VGFA. This construction has the 
sequences GTC and GYACVC instead of the authentic VGF 

25 sequences GDC and GMYCRC. 
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N-gene •» 

ATGGATGCACAAACACGCCGCCGCGAACGTCGCGCAGAGAAACAGGCTCAATGGA 
MDAQTRRRERRAEKQAQWK 



**** 
Cla l 

AAGCAGCAAATCCCCTGTTGGTTGGGGTAAGCGCAAAACCAGTTCGGATCGATC 
AANPLLVGVSAKPVRI DP 



Ncol VGF ♦ 

CCATGGACATCCCGGCTATCCGTCTGTGCGGCCCGGAAGGCGACGGCTACTGCCT 
MDIPAIRLCGPEGDGYCL 
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Kpn l SphI 
GCATGGTACCTGCATCCATGCACGTGACATCGACGGCTACGCATGCGTTTGCTCT 
HGTCI HARDIDGYACVCS 



EcoRI 

CATGGCTACACTGGAATTCGTTGCCAGCATGTTGTTCTGGTCGACTACCAGCGT 
HGYTGIRCQHVVLVDYQR 



BamH I 
TAAGGATCC 
Ter 



2. Preparation of pBMll/NDP/VGFa 

The N-terminal sequence of VGPa is a truncated 
version of the natural VGF sequence and starts with the 
sequence DIPAIR. In addition, the VGPa sequence con- 
tains the altered sequences GTC and GYACRC instead of 
the natural VGP sequences GDC and GMYCRC. In this 
plasmid the VGFa gene is located downstream of 32 amino 
acids of the lambda N-protein and the dipeptide 
aspartic acid-proline. Treatment of the purified 
fusion protein with formic acid results in cleavage at 
the acid labile aspartic acid-proline peptide bond 
allowing separation of the VGFa protein from the lambda 
N-protein amino- terminus. Cleavage is such that the 
VGPa protein is left with the proline residue at the 
amino terminus. * 

a. Preparation of SphI digested, 
dephosphorylated pBMll/DP/VGFA 
Plasmid pBMll/DP/VGFA (10 ug) was digested $ 
with 30 units of Sph I and the 5' phosphates were 

removed by treatment with calf intestinal alkaline s 
phosphatase. The 5kb plasmid fragment was recovered 
after electrophoresis on an agarose gel. 
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b. Preparation of an EcoRI-SphI 70bp 
fragment of pBMll/DP/VGFA 
Plasmid pBMll/DP/VGFA {10 pg) was digested 
with 30 units of EcoR I and then 30 units of Sphl. The 
5 70bp fragment was' recovered after electrophoresis on an 
agarose gel. 

3. Ligation and isolation of pBMll/NDP/VGFa 

The 24bp fragment containing oligonucleotides 
VGF 1A and 2A, the 5kb Sph l fragment and the 70bp 
0 EcoRI-SphI fragment of pBMll/DP/VGFA were ligated 
together and the mixture was used to transform 
""'competent E. coli HB101 cells. The transf ormants were 
screened by nucleotide sequencing using the Sanger- 
dideoxy nucleotide method. A correct clone was isolated 
5 and denoted pBMll/NDP/VGFa. 

N-gene ■* 

ATGGATGCACAAACACGCCGCCGCGAACGTCGCGCAGAGAAACAGGCTCAATGGA 
MDAQTRRRERRAEKQAQWK 



**** 

Clal 

AAGCAGCAAATCCCCTGTTGGTTGGGGTAAGCGCAAAACCAGTTCGGATCGATCC 
AANPL. LVGVSAKPVRIDP 



Ncol VGF - 

CCATGGACATCCCGGCTATCCGTCTGTGCGGCCCGGAAGGCGACGGCTACTGCCT 
MDIPAIRLCGPEGDGYCL 



Kpn l Sph l 
GCATGGTACCTGCATCCATGCACGTGACATCGACGGCTACGCATGCCGTTGCTCT 
HGTCIHARDIDGYACRCS 



EcoR I 

CATGGCTACACTGGAATTCGTTGCCAGCATGTTGTTCTGGTCGACTACCAGCGT 
HGYTG IRCQHVVLVDYQR 



Bam HI 

TAAGGATCC 
Ter 
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B. Modified Synthetic TGF-VGF Hybrids 
1. Preparation of pBMll/NDP/TTV 
5 In this construct, the synthetic modified TTV 

chimeric gene is expressed as the C-terminal portion of 
a fusion protein having the first 32 amino acids of the 
N-gene at the N-terminus. An acid labile aspartic 
acid-proline dipeptide separates the two parts of the 
10 fusion. The hybrid growth factor contains the amino 
acid sequence of human TGF in the amino terminal two- 
thirds of the gene with the exception of the sequence 
QEEK which was altered from the natural human TMR se- 
quence QEDK. The carboxy terminus was derived from the 
15 amino acid sequence of VGF and terminated with the se- 
quence YQR upstream of the natural sequence PNT. 

a. Preparation of 5kb Ncol pBMll plasmid 
fragment 

Plasmid pBMll/NDP/VGFA was digested with Nco l 
20 and the 5kb Nco l plasmid fragment was gel purified. 
This fragment has one Nco l overhang at the aspartic 
acid-proline cleavage site downstream of the sequences 
coding for the first 32 amino acids of the N-gene. The 
other Nco l site is in the neomycin resistance gene. 
25 b. Preparation of 0.6kb NcoI-BamHI pBMll 

plasmid fragment 
Plasmid pBMll/N/TTV was digested with Nco l and 
BamHI and the O.Skb Nco I- BamH I plasmid fragment was gel 
purified. This fragment has the Nco l overhang in the 
30 neomycin resistance gene. 

c. Preparation of the 170bp synthetic 
TGF/TGF/VGF fragment 
Plasmid pLEBam/TTV was digested with Nco l and 
BamH I and the Nco I- BamH I 170bp fragment containing the 
35 TGF/TGF/VGF synthetic gene was gel purified. This 
fragment has the Nco l overhang at the 5' end of the 
gene and the BamH I overhang at the 3 1 end of the gene. 
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d. Ligation and isolation of dBM11/NDP/TTV 
The 5kb Nco l and the 0.6kb NcoI-BamHI plasmid 
fragments were ligated with the 170bp NcoI-BamHI TTV 
gene using DNA ligase and the resulting mixture was 
5 used to transform competent HB101. The transf ormants 
were selected on neomycin such that only colonies with 
correctly reconstructed neomycin resistance genes would 
survive. Transf ormants were screened using restriction 
analysis with Ncol and nucleotide sequencing using the 
10 Sanger-dideoxy technique. A correct construction was 
isolated and denoted pBMll/NDP/TTV. 



ATGGATGCACAAACACGCCGCCGCGAACGTCGCGCAGAGAAACAGGCTCAATGGA 
15 MDAQTRRRE RRAEKQAQWK 

Cla l 

AAGCAGCAAATCCCCTGTTGGTTGGGGTAAGCGCAAAACCAGTTCGGATCGATC 
AANPLLVGVSAKP-VR I DP 

20 

Ncol TGP * 

CCATGGTTGTTTCTCACTTTAACGACTGCCCGGACTCTCATACTCAGTTTTGCTT 
MVVSHPNDCPDSHTQFCF 

Kpnl SphI VGF ♦ 

TCATGGTACCTGCCGTTTTCTGGTTCAGGAAGAAAAACCGGCATGCGTTTGCTCT 
25 HGTCRPLVQEEKPACVCS 



EcoR I 

CATGGCTACACTGGAATTCGTTGCCAGCATGTTGTTCTGGTCGACTACCAGCGT 
HGYTG I RCQHVVLV DY QR 



30 



BamH I 
TAAGGATCC 
Ter 



35 



2. Preparation of pBMll/NDP/VTV 

In this construct, the synthetic modified VTV 
chimeric gene was expressed as the C-terminal portion 
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of a fusion protein having the first 32 amino acids of 
the N-gene at the N-terminus. An acid labile aspartic 
acid-proline dipeptide separates the two parts of the 
fusion. The hybrid growth factor contained the amino 
5 acid sequence of human TGF in the middle domain with 
the amino acid sequence QEEK replacing the natural 
sequence QEDK. The N-terminal and C-terminal domains 
were derived from the truncated VGF sequence and begin 
with the sequence DIPAIR and end with the sequence YQR 
10 which is upstream of the natural sequence PNT. 

a. Preparation of a 5kb BamHI-NcoI fragment 
of pBMll 

Plasmid pBMll/N/TTV was digested with BamH I 
and Ncol and the 5kb BamHI-Ncol fragment was gel puri- 
15 fied. This fragment contains a BamH I overhang at the 
3' end of the sequences coding for the first 32 amino 
acids of the N-gene and a Nco l site in the neomycin 
resist ence gene. 

b. Preparation of a 700 bp KpnI-Ncol 
20 fragment of pBMll/N/TTV 

Plasmid pBMll/N/TTV was digested with Kpn l and 
Nco l and the 700 bp Kpnl -Ncol fragment was gel 
purified. This fragment is made up of part of plasmid 
pBMll containing part of the neomycin resistence gene 
25 at the Nco l overhang , and the C-terminal VGF domain of 
the TTV synthetic gene at the Kpn l overhang. 

c. Ligation and isolation of pBMll/NDP/VTV 
Oligonucleotides VGF 103a and '104a, the 5kb 

BamH I - Nco l fragment of pBMll and the 700 bp Kpn l- Nco l 
30 fragment of pBMll/N/TTV were ligated together using DNA 
ligase and then used to transform competent HB101. The 
transf ormants were selected on neomycin and were 
screened by restriction analysis using Clal and 
nucleotide sequencing following the Sanger-dideoxy 
35 technique. 
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N-gene + 

ATGGATGCACAAACACGCCGCCGCGAACGTCGCGCAGAGAAACAGGCTCAATGGA 
MDAQTRRRERR.AEKQ AQWK 



**** 
Clal 

AAGCAGCAAATCCCCTGTTGGTTGGGGTAAGCGCAAAACCAGTTCGGATCGATC 
AANPLLVGVSAKPVRI DP 



Ncol VGP - 

CCATGGACATCCCGGCTATCCGTCTGTGCGGCCCGGAAGGCGACGGCTACTGCCT 
MDIPAIRLCGPEGDGYCL 



Kpnl TGF * Sghl VGP * 

GCATGGTACCTGCCGTTTTCTGGTTCAGGAAGAAAAACCGGCATGCGTTTGCTCT 
HGTCRFLVQEEKPACVCS 



15 EcoRI 

CATGGCTACACTGGAATTCGTTCGCAGCATGTTGTTCTGGTCGACTACCAGCGT 
HGY TG IRCQHVVLVDYQR 



10 



20 



BamH I 
TAAGGATCC 
Ter 



3. Preparation of pBM16/NDP/TW 

In this construct , the synthetic modified TW 
chimeric gene was expressed as the C-terminal portion 
of a fusion protein having the first 32 amino acids of 
the N-gene at the N-terminus. An acid labile aspartic 
acid-proline dipeptide separates the two parts of the 
fusion. The hybrid growth factor contained the amino 
acid sequence of human TGF in the N-terminal domain. 
The middle and C-terminal domains were derived from the 
truncated VGF sequence and end with the sequence YQR. 
In addition, the synthetic gene has the modification 
GYACVC for GMYCRC. 



35 
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a. Preparation of 4.3kb NcoI-BqIII fragment 
of pBMll/NDP/VGFa 

Plasmid pBMll/NDP/VGFa was digested with Ncol 
and Bgl ll and the 4.3kb fragment was gel purified. The 
5 Nco l overhang is positioned at the aspartic acid- 

proline cleavage site just downstream of the first 32 
amino acids of the N-gene. 

b. Preparation of the 1.2kb BamHI-Bglll 
fragment of pBMllMS 

10 Plasmid pBMllMS was digested with BamHI and 

Bgl ll and the 1.2kb fragment was gel purified. This 
fragment differs from the normal pBMll fragment in that 
the Nco l site in the neomycin resistence gene has been 
removed, and all subsequent vectors lacking this Nco l 

15 site are referred to as pBM16. 

c. Preparation of the 170bp NcoI-BamHI TW 
synthetic gene 

Plasmid pLEBam/TW was digested with Nco l and 
"BamHI and the 170 bp NcoI-BamHI fragment was gel 
20 purified. This synthetic gene fragment has the Nco l 
site at the 5' -end and the BamH I site at the 3' -end. 

d. Ligation and isolation of pBM16/NDP/TW 
The 4.3kb Ncol -Bglll fragment of 

pBMll/NDP/VGFa and 1.2kb BamHI-Bglll fragment of 
25 pBMllMS, and the 170 bp Nco I- BamH I TW synthetic gene 

fragment were ligated together using DNA ligase and the 
resulting mixture was used to transform competent 
HB101. The transformants were selected on neomycin and 
screened by restriction analysis and nucleotide 
30 sequencing using the Sanger-dideoxy technique. The 
plasmid is denoted pBM16 to indicate the loss of the 
Nco l restriction site in the neomycin resistance gene. 

35 N-gene * 

ATGGATGCACAAACACGCCGCCGCGAACGTCGCGCAGAGAAACAGGCTCAATGGA 
MDAQTRRRERRAEKQAQWK* 
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**** 
Clal 

AAGCAGCAAATCCCCTGTTGGTTGGGGTAAGCGCAAAACCAGTTCGGATCGATC 
AANPLLVGV SAKPVR I DP 



Ncol TGF * 

CCATGGTTGTTTCTCACTTTAACGACTGCCCGGACTCTCATACTCAGTTTTGCTT 
MVVSHPNDCPDSHTQPCF 



Kpnl VGP -> Sph I 
TCATGGTACCTGCATCCATGCACGTGACATCGACGGCTACGCATGCGTTTGCTCT 
HGTCIHARDIDGYACVCS 



EcoRI 

CATGGCTACACTGGAATTCGTTGCCAGCATGTTGTTCTGGTCGACTACCAGCGT 
HGYTGIRCQHVVLVDYQR 



BamH I 
TAAGGATCC 
Ter 



C. Synthetic EG? 

1. Preparation of pBMll/NDP/EGF 

In this construct the human EGP gene is ex- 
pressed as part of a fusion with the 32 N-terminal 
amino acids of the N-gene which is downstream of an 
Asp-Pro cleavage site. 

a. Preparation of a 5kb Ncol fragment of 
pBMll 

Plasmid pBMll/DP/VGPA was digested with Nco l 
and the 5' phosphates were removed by treatment with 
calf alkaline intestinal phosphatase. The 5kb plasmid 
fragment was gel purified. This fragment has one Nco l 
overhang at the Asp-Pro cleavage site downstream of the 
sequences coding for the first 32 amino acids of the 
N-gene. The other Nco l site is in the Neomycin resis- 
tance gene. 
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b. Preparation of a O.Skb NcoI-BamHI 
fragment of pBMll 

Plasmid pBMll/N/TTV was digested with Ncol and 
BamH I and the 0.6kb Nco I- BamH I plasmid fragment was gel 
5 purified. This fragment has the Nco l overhang in the 
Neomycin resistance gene. 

c. Ligation and Isolation of pBMll/DP/EGF 
The three sets of annealed EGF 

oligonucleotides with an Nco l overhang at the 5 ' end 
10 and a BamH I overhang at the 3' end, the 5kb Nco l 

fragment of pBMll and the 0.6kb Nco I- BamH I fragment of 
pBMll were ligated together using T4 ONA Ligase and the 
resulting mixture was used to transform competent E. 
coli HB101. The transf ormants were selected on 
15 Neomycin such that only colonies with a correctly 

reconstructed Neomycin resistance gene would survive. 
The transf ormants were screened by restriction analysis 
using EcoR I and BamH I and by DNA sequencing, as 
described above. 

20 



N-gene * 

ATGGATGCACAAACACGCCGCCGCGAACGTCGCGCAGAGAAACAGCGTCAATGGA 
MDAQTRRRERRAEKQAQWK 



25 



35 



*** 

AAGCAGCAAATCCCCTGTTGGTTGGGGTAAGCGCAAAACCAGTTCGGATCGATCC 
AANPLLVGVSAKPVRIDP 



EGP1 * EcoR I EGF 2 * 

CATGAATTCTGACTCTGAATGCCCGCTGTCTCATGACGGCTACTGCCTGCATGAC 
30 MNSDSECPLSHDGYCLHD 



EGF 3 - 

Nsi l Sph I 
GGCGTATGCATGTACATCGAAGCTCTGGAGAAGTACGCATGCAACTGCGTTGTTG 
GVCMY IEALDKYACNCVVG 



GCTACATCGGCGAACGTTGCCAGTACCGTGACCTGAAATGGTGGGAACTGCGTTA 
YIGERCQYRDLKWW-ELR* 
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5 

D. Synthetic Platelet Factor 4 

1. Preparation of pBMll/NDP/PF4 (N-qene/DP/ 
Platelet Factor 4) 

In this construct , the synthetic PF4 gene is 
10 expressed as the C-terminal portion of a fusion protein 
having the first 32 amino acids of the N-gene at the N- 
terminus. An acid labile aspartic acid-proline 
dipeptide separates the two parts of the fusion. 

a. Preparation of 5kb Ncol pBMll plasmid 
15 fragment 

Plasmid pBMll/NDP/VGFA was digested with Ncol 
and the 5kb Nco l plasmid fragment was gel purified. 
This fragment has one Nco l overhang at the aspartic 
acid-proline cleavage site downstream of the sequences 
20 coding for the first 32 amino acids of the N-gene. The 
other Nco l site is in the neomycin resistance gene. 

b. Preparation of 0.6kb NcoI-BamHI pBMll 
plasmid fragment 

Plasmid pBMll/N/PF4 was digested with Nco l and 
25 BamHI and the 0.6kb NcoI-BamHI plasmid fragment was gel 
purified. This fragment has the Nco l overhang in the 
neomycin resistance gene. 

c. Ligation and isolation of pBMll/NDP/PF4 
The 5kb Nco l and the 0.6kb Nco I- Bam HI plasmid 

30 fragments were ligated with the PF4 gene using DNA 

ligase and the resulting mixture was used to transform 
competent HB101. The transf ormants were selected on 
neomycin such that only colonies with correctly re- 
constructed neomycin resistance genes would survive. 

35 Transf ormants were screened using restriction analysis 
with Nco l and nucleotide sequencing using the Sanger- 
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dideoxy technique. A correct construction was isolated 
and denoted pBMll/NDP/PF4 . 

The nucleotide sequence and corresponding 
amino acid sequence of the synthetic platelet factor 4 
5 gene in fusion downstream of the nucleotide sequences 
coding for the first 32 amino acids of the 
bacteriophage X N-gene and the acid labile dipeptide 
Asp-Pro (***) in the expression vector pBMll is as 
follows. 

10 



N-gene -> 





H 


D 


A 


Q 


T 


R 


R 


R 


E 


R 


R 


A 


E 


K 




ATG 


GAT 


GCA 


CAA 


ACA 


CGC 


CGC 


CGC 


GAA 


CGT 


CGC 


GCA 


GAG 


AAA 




Q 


A 


Q 


W 


K 


A 


A 


N 


P 


L 


L 


V 


G 


V 




CAG 


GCT 


CAA 


TGG 


AAA 


GCA 


GCA 


AAT 


CCC 


CTG 


TTG 


GTT 


GGG 


GTA 


15 














































*** 


*** 




PF4 


-> 








S 


A 


K 


P 


V 


R 


I 


D 


P 


M 


E 


A 


E 


E 




AGC 


GCA 


AAA 


CCA 


GTT 


CGG 


ATC 


GAT 


CCC 


ATG 


GAA 


GCT 


GAA 


GAG 




D 


G 


D 


L 


Q 


C 


L 


C 


V 


K 


T 


T 


S 


Q 




'GAT 


GGA 


GAT 


CTG 


CAA 


TGC 


CTG 


TGC 


GTT 


AAG 


ACT 


ACG 


TCT 


CAG 


20 


V 


R 


P 


R 


H 


I 


T 


S 


L 


E 


V 


I 


K 


A 




GTT 


AGA 


CCG 


CGG 


CAT 


ATC 


ACT 


AGC 


CTC 


GAG 


GTT 


ATC 


AAA 


GCG 




G 


P 


E 


C 


P 


T 


A 


Q 


L 


I 


A 


T 


L 


K 




GGC 


CCA 


CAC 


TGT 


CCG 


ACT 


GCG 


CAG 


CTG 


ATC 


GCG 


ACT 


CTG 


AAA 




N 


G 


R 


K 


I 


C 


L 


D 


L 


Q 


A 


P 


L 


Y 


25 


AAC 


GGC 


CGT 


AAA 


ATA 


TGT 


CTG 


GAT 


CTG 


CAG 


GCA 


CCG 


CTG 


TAC 




K 


K 


I 


I 


K 


K 


L 


L 


£ 


S 


*** 










AAG 


AAA 


ATC 


ATC 


AAA 


AAG 


CTT 


CTC 


GAG 


TCT 


TGA 









30 E. Oncostatin M 

1. Construction of pBM16/NDP/OncM 

a. Preparation of modified Oncostatin M gene 
fragment 

Plasmid pOncMW2 containing a modified Onco- 
35 statin M gene was digested with Ncol and BamHI and the 
700 bp Nco I- BamH I Oncostatin M gene was gel purified. 
This fragment contained the Ncol overhang at the 5 1 end 
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of the gene and the BamH I overhang at the 3* end of the 
gene. 

b. Ligation and isolation of pBM16/NDP/OncM 
The 700 bp NcoI-BamHI fragment of the modified 

5 Oncostatin M gene and the Nco I- BamH I fragment of the 
plasmid pBM16/NDP were ligated together with T4 ligase 
and transformed into competent HB101 E. coli. The 
transf ormants were screened for correct construction by 
nucleotide sequencing using the Sanger-dideoxy technique. 
10 A correct colony was chosen and termed pBM16/NDP/OncM. 

c. Preparation of Oncostatin M using 
pBM16/NDP/OncM 

E. coli HB101 strain harboring the plasmid 
pBM16/NDP/OncM f which encodes the first 32 amino acids 

15 of the bacteriophage x N-gene fused to an acid cleav- 
able dipeptide (DP)/ and the synthetic OncM gene was 
grown at 30°C. At an ODgOO °^ approximately 0.9 the 
temperature was raised to 42°C which inactivates Jbhe 
temperature sensitive repressor inducing the PL promo- - 

20 ter to allow transcription and translation of the NDP/ 
OncM fusion gene. The fusion protein is characterized 
by having the 32 amino-terminal residues of the bacter- 
iophage x N-gene followed by the acid labile dipeptide 
Asp-Pro followed by the 228 amino acids of Oncostatin M 

25 including its N-terminal methionine. 

2. Preparation of Oncostatin M Using pBMX 

An 82 bp fragment was synthesized chemically 
as shown below. 



30 



5' - CATGGCCATTGAAGGGCGCGCTGCGATCGGCAGCTGCTCGAAA 
3' - CGGTAACTTCCCGCGCGACGCTAGCCGTCGACGAGCTTT 

GAGTACCGCGTGCTCCTTGGCCAGCTCCAGAAGCAGACA - 3' 
CTCATGGCGCACGAGGAACCGGTCGAGGTCTTCGTCTGTCTAG - 5' 
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The 82 bp fragment and truncated OncM cDNA ( Bgl ll- 
Hindlll) isolated from pOncMW2 were cloned into the 
pBM16/NDP/TW vector prepared by Ncol and Hin di I I 
double digestion. The DNA was used to transform E. 
5 coli DH5a. The clone pBMX was isolated and confirmed 
to carry the predicted sequence. Since the coding se- 
quence contains the -I-E-G-R-, a factor X recognition 
site, precisely fused to the N-terminal of mature OncM, 
the recombinant protein produced by pBMX should be 
10 cleaved at the R residue of -I-E-G-R- to generate 
mature OncM with the authentic N-terminal sequence 
following treatment of activated factor X. Similar 
procedures were used to prepare Oncostatin M using pBMX 
as to prepare Oncostatin M from pBM16/NDP/OncM. 

15 

Example VI. 



20 



25 



30 
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Preparation of the Polypeptide of Interest 
as a Fusion Protein 
with the Alkaline Phosphatase Signal Secruence 

A. Preparation of pBMll/PAD/EGF 

Synthetic oligonucleotides were designed to 
allow insertion of DNA coding for a modified alkaline 
phosphatase signal peptide and a linker region with 3 
cloning sites ( Hin dlll, Sma l and BamH I) into the pBMll 
expression vector downstream of the P L promoter and N 
gene ribosomal binding site. The nucleotide sequence 
was optimized to be as similar as possible to the 
nucleotide sequence of the amino terminus of the lambda 
N gene as the lambda N gene sequence has evolved with 
that of its ribosomal binding site for efficient 
ribosome initiation and translation. In addition, the 
second amino acid of the alkaline phosphatase signal 
sequence, the basic amino acid lysine was changed to an 
acidic amino acid, aspartic acid. 
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1. Preparation of 0,17kb EcoR I-BamHI fragment of 
EGF 

Plasmid pBMll/NDP/EGP (30ug) was digested with 
30 units of Eco RI and then treated with 4 units of 
5 Klenow fragment of DNA polymerase to create blunt 

ends. The DNA was finally digested with 30 units of 
BamHI and the 0.17kb fragment of the EGF gene was 
recovered after electrophoresis on an agarose gel. The 
DNA so purified has a blunted EcoR I site at the 5' end 
10 and a BamH I overhang at the 3' end, 

2 . Preparation of 0.5kb Pvu I -Hind III fragment 
Of pBMll/PAD 

Plasmid pBMll/PAD (18ug) was digested with 30 
units of Hindlll and then treated with Klenow fragment 
15 to blunt the ends. The DNA was then digested with Pvu I 
and the 0.5kb Pvu l-Hindlll (blunt) fragment was 
recovered after electrophoresis on an agarose gel. 

3. Preparation of the 5.2kb Pvu I-BamHI fragment 
of pBMll/PAD 

20 Plasmid pBMll/PAD (18ug) was digested with 30 

units of Pvu l followed by 30 units of BamH I. The 5.2kb 
fragment was recovered after electrophoresis on an 
agarose gel. 

4. Ligation and isolation of pBMll/PAD/EGF 

25 The 0.17 kb EcoR I ( blunt ) -BamHI fragment , the 

0.5kb Pvul-Hindlll (blunt) fragment , and the 5.2kb 
Pvu I - BamH I fragment were ligated together and the 
resulting mixture was used to transform competent E. 
coli HB101. The transf ormants were serened using DNA 

30 sequencing, as described above. The desired signal 
sequence/EGF region had the following sequence: 



35 



Signal Sequence 

ATGGATCAATCTACAATCGCCCTCGCACTTCTCCCACTGCTGTTCACT 
MDQSTIALALLPLLFT 
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EGF 

CCAGTGACAAAAGCTAATTCTGACTCTGAATGCCCGCTGTCTCATGAC 
PVTKANSDSECPLSHD 



Nsil 

5 GGCTACTGCCTGCATGACGGCGTATGCATGTACATCGAAGCTCTG 
GYCLHDGVCMYIEAL 



SphI 

GACAAGTACGCATGCAACTGCGTTGTTGGCTACATCGGCGAACGT 
DKYACNCVVGYIGER 

10 

BamHI 

TGCCAGTACCGTGACCTGAAATGGTGGGAACTGCGTTAAGGATCC 
CQYRDLKWWELR* 



15 



20 



25 



30 



35 



The efficacy of the production of foreign 
protein in the pBMll/PAD expression system and the 
ability to purify functionally active foreign proteins 
from the fusion product has been show using 
pBMll/PAD/EGP as an example. After size exclusion 
chromatography (TSK-250), 10.3 mg of equivalents of 
active EGF fusion polypeptide was recovered from 23 g 
(8 liters) of E. coli derepressed to express the EGF 
gene. Forty percent of the EGF activity was derived 
from EDF cleaved from the signal sequence. 

B. Construction of pBMll/PAD/OncM 

1. Preparation of modified Oncostatin M gene 
fragment 

Plasmid p0ncM7V2 containing a modified Onco- 
statin M gene was digested with Ncol and the 5 1 over- 
hanging bases were removed by treatment with SI nucle- 
ase leaving the fragment blunt-ended. The nuclease 
treatment removed the codons for the initiating methio- 
nine. The plasmid was further digested wth BamHI and 
the 700 bp Ncol (blunt) -BamHI Oncostatin M gene fragment 
was gel purified. This fragment contained the Ncol 
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blunt end at the 5' end of the gene and BamHI overhang 
at the 3' end of the gene, 

2. Preparation of pBMHM3/PAD fragments 

Plasmid pBMHM3/PAD containing the nucleotide 
5 sequences coding for a modified alkaline phosphatase 
signal sequence was digested with Hindlll which cuts 
directly downstream of the signal sequence. The 
overhanging ends were filled in and made blunt using 
Klenow fragment of DNA polymerase. The resulting DNA 
10 was further digested with Pvul and the 680 bp 
Hin dlll (blunt )-PvuI fragment was gel purified. 

Plasmid pBMllM3/PAD was also digested with 
BamH I and Pvu l and the 5 kb BamHI-PvuI fragment was gel 
purified. 

15 3. Ligation and isolation of pBMll/PAD/OncM 

The 700 bp Ncol (blunt) -BamHI Oncostatin M gene 
fragment/ the 680 bp Hindlll (blunt ) -Pvul fragment of 
pBMHM3/PAD and the 5 kb BamH I - Pvu l fragment of 
pBMHM3/PAD were ligated together using T4 ligase and 

20 transformed into competent HB101 E. coli. Correct con- 
struction was assayed by nucleotide sequencing using 
the Sanger-dideoxy technique. A correct colony was 
chosen and designated pBMll/PAD/OncM. 

25 C. Preparation of pBMl 1 /P AD/nVGFa 

Synthetic oligonucleotides were designed to 
link the VGPa synthetic gene with an alkaline phospha- 
tase modified signal sequence to provide for an optimal 
signal sequence cleavage site by coding for the addi- 

30 tional N-terminal residues occurring immediately down- 
stream of the signal sequence cleavage site in the 
natural VGF, denoted extreme N-terminus above. The 
nVGFa sequence contains the altered sequences GTC and 
GYACRC instead of the natural VGF sequences GDC and 

35 GMYCRC and terminates with the sequence YQR upstream of 
the natural sequence PNT. In this expression system, 
for the majority of the molecules, the signal sequence 
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remains attached to the nVGFa forming a fusion protein 
with nVGFa at the C-terminus. 

1. Preparation of O.Skb Hindlll-Pvul digested 
pBMll/PAD 

5 Plasmid pBMll/PAD was digested with Hindlll 

and Pvul and the O.Skb fragment was gel purified. The 
Hindlll site is located at the C-terminus of the modi- 
fied alkaline phosphatase signal sequence. 

2. Preparation of the 5.2kb PvuI-BamHI pBMll * 
10 plasmid fragment 

Plasmid pBMll/NDP/VGFa was digested with Pvu l 
and BamH I and the 5.2kb plasmid fragment was gel 
purified. 

3. Preparation of the 170bp Ncol (blunt ) -BamHI 
15 . synthetic VGFa gene 

Plasmid pBMll/NDP/VGFa was digested with Ncol 
and the 5* overhangs were removed by treatment with Sl- 
nuclease. This created a blunt end at the first codon 
of the VGFa truncated synthetic gene. The DNA was then 
20 digested with BamHI and the 170bp Ncol (blunt) -BamHI 
fragment was gel purified. 

4. Ligation and isolation of pBMl 1 /P AD/n VGFa 
Oligonucleotides VGF105 and 106 , the O.Skb 

Hindlll-Pvul fragment of pBMll/PAD, the 5.2kb Pvul- 
25 BamH I pBMll fragment and the 170bp Ncol ( blunt ) -BamHI 
synthetic VGFa gene were ligated together using DNA 
ligase and the resulting mixture was used to transform 
competent HB101. The transf ormants were selected on 
neomycin and screened by restriction analysis and 
30 nucleotide sequencing using the Sanger-dideoxy 

technique. A correct construct was isolated containing 
the modified alkaline phosphatase signal sequence in 
frame with the nVGFa gene. 
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Signal Sequence * 

ATGGATCAATCTACAATCGCCCTCGCACTTCTCCCACTGCTGTTCACTCCAGTGA 
MDQSTIALALLPLLFTPVT 



nVGF * 

5 CAAAAGCTGACTCTGGTAACGCTATCGAAACTACTTCTCCGGAAATCACTAACGC 
KADSGNAIETTSPEITNA 



TACTACTGACATCCCGGCTATCCGTCTGTGCGGCCCGGAAGGCGACGGCTACTGC 
TTDIP AIRLCGPEGDGYC 



10 



Kpn l Sph I . 

CTGCATGGTACCTGCATCCATGCACGTGACATCGACGGCTACGCATGCCGTTGCT 
LHGTCIHARDIDGYACRCS 



EcoRI 

CTCATGGCTACACTGGAATTCGTTGCCAGCATGTTGTTCTGGTCGACTACCAGCG 
15 HGYTGIRCQHVVLVDYQR 



BamHI 
TTAAGGATCC 
Ter 

20 

D. Preparation of pBMll/PAD/PF4 (Signal sequence of 
alkaline phosphatase with Asp as residue 2 instead 
of Lys/Platelet Factor 4) , 

The nucleotide sequence and corresponding 
25 amino acid sequence of the synthetic platelet factor 4 
gene in fusion downstream of the nucleotide sequences 
coding for a modified alkaline phosphatase signal 
peptide were prepared essentially as described above 
for pBMll/PAD/nVGFa, except that the synthetic PF4 gene 
30 was used instead of the synthetic VGFa gene in Step 
3. The construct isolated is as follows. Predicted 
cleavage site is noted with (***). 



35 
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Signal sequence 

M D Q S 
ATG GAT CAA TCT 


-> 
T 
ACA 


I 

ATC 


A 
GCC 


L 
CTC 


A 
GCA 


L 
CTT 


L 
CTC 


P 
CCA 


L 
CTG 


L 
CTG 


5 


P 
TTC 


T 
ACT 


p 
CCA 


v 

GTG 


T 
ACA 


K 
AAA 


*** 

A 
GCT 


PF4 

E 
GAA 


-> 
A 
GCT 


E 
GAA 


£ 
GAG 


D 
GAT 


G 
GGA 


D 
GAT 




L 
CTG 


Q 
CAA 


C 

TGC 


L 
CTG 


C 
TGC 


V 
GTT 


K 
AAG 


T 
ACT 


T 
ACG 


S 
TCT 


Q 
CAG 


V 
GTT 


R 
AGA 


P 
CCG 


10 


R 
CGG 


H 
CAT 


I 
ATC 


T 
ACT 


s 

AGC 


L 
CTC 


E 
GAG 


v 

GTT 


I 
ATC 


K 
AAA 


A 
GCG 


G 
GGC 


p 
CCA 


CAC 




C 
TGT 


P 
CCG 


T 
ACT 


A 
GCG 


Q 
CAG 


L 
CTG 


• I 
ATC 


A 
GCG 


T 
ACT 


L 
CTG 


K 
AAA 


N 
AAC 


G 
GGC 


R 
CGT 




K 
AAA 


I 
ATA 


C 
TGT 


L 
CTG 


D 
GAT 


L 
CTG 


Q 
CAG 


A 
GCA 


P 
CCG 


L 
CTG 


y 

TAC 


K 
AAG 


K 
AAA 


I 
ATC 


15 


I 
ATC 


K 
AAA 


K 
AAG 


L 
CTT 


L 
CTC 


£ 
GAG 


S 
TCT 


*** 
TGA 
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Example VII. 

20 

Expression of the Polypeptide of Interest 
as a Fusion Protein 
with the Alkaline Phosphatase Signal Sequence 

A. Preparation of pBMll/PAK/nVGFa (Alkaline 
25 phosphatase signal seguence/nVGFa with natural VGF 

N- terminus and sequences GTC and GYACRC) 

Plasmid pBMll/PAD/nVGF was mutagenized in 
vitro (Morinaga et al. , Biotechnology (1984) 2:636-643) 
to alter the codons coding for the second amino acid in 
the signal sequence, namely to change the Asp (D) codon 
to that for Lys (K) the residue found in the natural 
sequence. This mutagenesis also introduced a Pvul site 
into the signal sequence. 



35 
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Signal Sequence * 

ATGGATCAATCTACAATCGCCCTCGCACTTCTCCCACTGCTGTTCACTCCAGTGACAAAA 
MDQSTIAL ALLPLLFTPVTK 



nVGP * 

5 GCTGACTCTGGTAACGCTATCGAAACTACTTCTCCGGAAATCACTAACGCTACTACT 
ADSGNAIETTSPEITNATT 



Kpn l 

GACATCCCGGCTATCCGTCTGTGCGGCCCGGAAGGCGACGGCTACTGCCTGCATGGT 
DIPAIRLCGPEGDGYCLHG 



10 



Sph I 

ACCTGCATCCATGCACGTGACATCGACGGCTACGCATGCCGTTGCTCTCATGGCTACACT 
TCIHARDIDGYACRCSHGYT 



Eco RI Bam HI 
1 5 GGAATTCGTTGCCAGCATGTTGTTCTGGTCGACTACCAGCGTTAAGGATCC 
G I RCQHVVLVDY QR Ter 



B. Preparation of pBMll/PAK/EGF 

In this expression cassette the EGP gene is 
part of a fusion with, the alkaline phosphatase signal 
sequence. 

Plasmid pBMll/PAD/EGF was mutagenized in vitro. 
to alter the codons coding for the second amino acid, in 
2 5 the signal sequence, to change the Asp (D) codon to 

that for Lys (K) the residue found in the natural se- 
quence. This mutagenesis also introduced a - Pvu l site 
into the signal sequence. 



20 



30 



Pvu l 

Signal Sequence - 

ATGAAACAATCTACGATCGCCCTCGCACTTCTCCCACTGCTGTTCACTCCAGTGA 
MKQSTIALALLPLLPTPVT 



35 



• EGF * 

CAAAAGCTAATTCTGACTCTGAATGCCCGCTGTCTCATGACGGCTACTGCCTGCA 
KANSDSECPLSEDGYCLH 
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Nsil Sph I 
TGACGGCGTATGCATGTACATCGAAGCTCTGGACAAGTACGCATGCAACTGCGTT 
DGVCMYIEALDKYACNCV 



5 GTTGGCTACATCGGCGAACGTTGCCAGTACCGTGACCTGAAATGGTGGGAACTGC 
VGYIGERCQY RDLKWWELR 

BamHI 

GTTAAGGATCC 
* 

10 

C. Preparation of TacPak/EGF (alkaline phosphatase 
signal secruence/human EGF ) 
1. Preparation of Plasmid Fragments 
15 Plasmid pl35-l was derived from plasmid pDR540 

(Pharmacia) and contained the Cro gene SD and a Bgl ll 
site downstream of the lac SD. pDR540 is an expression 
vector containing the trp-lac hybrid promoter • pl35-l 
was digested with Bgl ll and BamH I and treated 'with 
20 bacterial alkaline phosphatase . 

Plasmid pBMll/PAK/EGF was digested with PvuII 
and BamH I and the -230 bp fragment coding for part of 
the alkaline phosphatase signal sequence and human EGF 
was isolated. 
25 2. preparation of TacPakl and TacPak2 

Oligonucleotides 

Synthetic oligonucleotides TacPakl and TacPak2 
were designed with an overhang , compatible with the 
Bglll site of pl35-l and a PvuII overhand, compatible 
30 with the PvuII site in the alkaline phosphatase/EGF 
Pvu II/ BamH I fragment. The oligonucleotides were 
synthesized on an Applied Biosystems Oligonucleotide 
Synthesizer. 
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Ball I Pvul 

~T~ T 

TacPakl 5' GATCTATGAAACAATCTACGAT 3' 
TacPak2 3' ATACTTTGTTAGATGC 5' 



3. Ligation and Isolation of TacPak/EGF Clone 
The Bglll-BamHI digested pl35-l, the 230 bp 
PAK/EGF fragment and oligonucleotides TacPakl and 
TacPak2 were ligated using DNA ligase, transformed into 
10 competent HB101 and a correct construct was isolated by 
DNA sequencing. 



. (Hindlll site of pDR540) 
15 AAGCTTACTCCC 



trp-35 (16bp) lac-10 

CATCCCCCTG [TTGACA] ATTAATCATCGGCTCG (TATAATG) 

20 mRNA 5' lad binding site lacSD 

TGTGG/AATTGTG AG CGG ATAACAATTT CACAC {AGGA} AACAGGATCACTA 



Pvu l 

croSD (llbp)Bglll 
{AGGA} GGTTCAGATCT 



25 



Signal Sequence -> 

ATGAAACAATCTACGATCGCCCTCGCACTTCTCCCACTGCTGTTCACTCCAGTGA 
MKQSTIALALLPLLF-TPVT 



EGP -> 

3 0 CAAAAGCTAATTCTGACTCTGAATGCCCGCTGTCTCATGACGGCTACTGCCTGCA 
KANSDSECPLSHDGYCLH 



Nsil Sph I 
TGACGGCGTATGCATGTACATCGAAGCTCTGGACAAGTACGCATGCAACTGCGTT 
DGVCMY. IEALDKYACNCV 



35 



GTTGGCTACATCGGCGAACGTTGCCAGTACCGTGACCTGAAATGGTGGGAACTGC 
VGYIGERCQYRDLKWWELR 
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BamH I 
GTTAAGGATCC 



Example VIII. 

Expression of a Polypeptide of Interest as a Fusion 
Protein with the Alkaline .Phosphatase Signal Sequence 
Using an Expression Cassette 
10 Comprising a Transcriptional Termination Region 

A. Preparation of pTCPt/EGF ( [trp-35]16bp[lac- 
101 ClacSD]llbp[ATG] /alkaline phosphatase 
signal/human EGF/trans. term.-NEO ) 
15 This plasmid is designed to have the tac 

promoter elements and utilize the cro SD to express 
human EGF behind the alkaline phosphatase signal 
sequence. It has a pBR322 background with the Neomycin 
resistance gene. 
20 1# Preparation of the 420bp Hindlll (blunt) -BamHI 

fragment of TacPak/EGF 

TacPak/EGF was digested with Hindlll and then 
treated with the Klenow fragment of DNA polymerase to 
create blunt ends. The DNA was then digested with 
25 BamH I and the 420bp fragment containing the tac 
promoter elements and the coding region for the 
alkaline phosphatase signal sequence and human EGF was 
isolated by agarose gel electrophoresis. 

2- Preparation of the 2.8kb EcoRK blunt) -BamHI 
30 fragment of pBMl 6 t/NDP/VGFa 

pBM16 t/NDP/VGFa was digested with EcoRI and 
then treated with Klenow to create blunt ends. The DNA 
was then digested with BamHI and the 2.8 kb fragment 
was isolated. This DNA fragment contains the pBR322 
35 origin, the neomycin resistance gene with its Ncol site 
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V 



10 



20 



25 



30 



35 



removed, and the gene32-like transcription terminator 
downstream of the BamH I site. 

3. Ligation and Isolation of pTCPt/EGF 

The 2.8kb EcoRI ( blunt ) -BamHI fragment of 
pBMl 6 1 /NDP/VGFa was ligated to the 420bp 
Hind lll ( blunt ) -BamH I fragment of TacPak/EGF and the 
resulting DNA was used to transform competent 
JM109(lacIq) . A correct construct was isolated by its 
resistance to neomycin and by DNA sequencing. 



(Hindi II site of pDR540) 
AAGCTTACTCCC 

15 trp-35 (16bp) lac-10 

CATCCCCCTG [TTGACA] ATTAATCATCGGCTCG ( TATAATG ) 



mRNA 5' lad binding site lacSO 
TGTGG /AATTGTG AGCGGATAACAATTTCACAC {AGGA} AACAGGATCACTA 



Pvul 

croSO (llbp)Bglll Signal Sequence -> 

{AGGA} GGTTCAGATCT ATGAAACAATCTACGATCGCCCTCGCACTTCTCC 

MKQSTIALALLP 



EGF -> 

CACTGCTGTTCACTCCAGTGACAAAAGCTAATTCTGACTCTGAATGCCCGCTGTC 
LLFTPVTKANSDSECPLS 



Nsil 

TCATGACGGCTACTGCCTGCATGACGGCGTATGCATGTACATCGAAGCTCTGGAC 
HDGYCLHDGVCMY I E A X D 



Sph I 

AAGTACGCATGCAACTGCGTTGTTGGCTACATCGGC 
KYACNCVVGYIG 



Bam HI 

GAACGTTGCCAGTACCGTGACCTGAAATGGTGGGAACTGCGTTAAGGATCCGTGA 
E.RCQYR DLKWWELR* 
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Trans. Term. 
CTAATTGGGGACCCTAGAGGTCCCCTTTTTTATTTTAAAACGATC 



10 



15 



20 



25 



30 



B. Preparation of pTCPt/nVGFa ( [trp-35]16bpf lac-10] 
ClacSD] [croSD]llbp[ATG]/ alkaline phosphatase 
signal/n-terminal VGFa with sequence GTC and 
GYACRC) /trans. term.-NEO ) 

This plasmid has the tac promoter elements and 
uses the cro SD to express the modified VGF gene with 
the N-terminal extension downstream of the alkaline 
phosphatase signal sequence. The plasmid has a pBR322 
background with the neomycin resistance gene. 

1« Preparation of the 350bp PvuI-BamHI fragment 
Of pBMll/PAK/nVGFa 

Plasmid pBMll/PAK/nVGFa was digested with Pvul 
and BamH I and the 350bp fragment was isolated by gel 
electrophoresis. This fragment contains most of the 
alkaline phosphatase signal sequence and the nVGFa 
gene. 

2. Preparation of the 2.8kb PvuI-BamHI fragment 
of pTCPt/EGF 

Plasmid pTCPt/EGF was- digested with Pvu l and 
BamH I and the 2.8kb fragment was isolated by gel 
electrophoresis . 

3. Ligation and Isolation o£ pTCPt/nVGFa 

The 2.8kb fragment and the 350bp fragment were 
ligated using DNA ligase and the DNA was used to 
transform competent JM109 (laclq) . A correct construct 
was isolated using restriction analysis. 



(Hindi! I site of pDR540) 
35 AAGCTTACTCCC 
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trp-35 (16bp) lac-10 

CATCCCCCTG [TTGACAJ ATTAATCATCGGCTCG (TATAATG) 



5 



10 



15 



20 



25 



tnRNA 5' lad binding site lacSD 
TGTGG /AATTGTG AGCGGATAACAATTTCACAC {AGGA} AACAGGATCACTA 



Pvul 

croSD (llbp)Bglll Signal Sequence -> 

{AGGA} GGTTCAGATCT ATGAAACAATCTACGATCGCCCTCGCACTTCTCCC 

MKQSTIALALLP 



nVGP -> 

ACTGCTGTTCACTCCAGTGACAAAAGCTGACTCTGGTAACGCTATCGAAACTACT 
LLFTPVTKADSGNAIETT 



TCTCCGGAAATCACTAACGCTACTACTGACATCCCGGCTATCCGTCTGTGCGGCC 
SPEITNATTDIPAIRLCGP 



Kpn l 

CGGAAGGCGACGGCTACTGCCTGCATGGTACCTGCATCCATGCACGTGACATCGA 
EGDGYCLHGTCIHARDID 



Sph I EcoR I 
CGGCTACGCATGCCGTTGCTCTCATGGCTACACTGGAATTCGTTGCCAGCATGTT 
GYACRCSHGYTGIRCQHV 



BamHI Trans . 

GTTCTGGTCGACTACCAGCGTTAAGGATCCGTGACTAATTGGGGACCCTAGAGGT 
VLVDYQR* 



Term. 

CCCCTTTTTTATTTTAAAACGATC 



30 C. Preparation of pTNPt/EGF ( [trp-35 ]17bp[ lac- 
10 ] (nSD]8bp[ATG 1 /alkaline phosphatase signal/human 
EGF/trans. term.-NEO ) 

This plasmid is designed to have the tac 
promoter elements and utilize the N-gene SD to express 
35 human EGF behind the alkaline phosphatase signal 

sequence. It has a pBR322 background with the Neomycin 
resistance gene. 
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1. Preparation of 2.8kb Pvul -BamHI pTNPt 

Plasmid pTNPt was digested with Pvul and BamH I 
and the 2.8kb fragment was isolated by gel 
electrophoresis . 
5 2. Preparation of 300bp PvuI-BamHI fragment of 

pBMl 1 /P AK/EGF 

Plasmid pBMll/PAK/EGP was digested with Pvu l 
and Bam HI and the 300bp fragment was isolated, 
3. Ligation and Isolation of pTNPt/EGF 
10 The 2.8kb fragment and the 300bp fragment were 

ligated using DNA ligase and the DNA was transformed 
into competent JM109 (Laclq) . A correct construct was 
isolated by restriction analysis and by DNA sequencing. 



15 



(EcoRI site of pBMll) 

GAATTACTCCCCATCC 



Sst I 

20 • trp-35 (17bp) lac-10 5' lac 

CCCTG [TTGACA] ATTAATCATCGAGCTCG (TATAATG) TGTGG/AATTG 



Bsm I 

mRNA-> n mRNA-> 

TGTGAGCGGATAACAATTTCACACAGCATTCAAAGCAGAAGGCTTTGGGGTGTGT 



25 



GATACGAAACGAAGCATTGGCCGTAAGTGCGATTCCGGATTAGCTGCCAATGTGC 
CAATCGCGGGGGGTTTTCGTTCAGGACTACAACTGCCACACACCACCAAAGCTAA 



30 Pvu l 
nSD (8bp) Signal Sequence -> 

CTGAC {AGGA} GAATCCAG ATGAAACAATCTACGATCGCCCTCGCACTTCTC 

MKQSTIALALL 



35 



EGF -> 

CCACTGCTGTTCACTCCAGTGACAAAAGCTAATTCTGACTCTGAATGCCCGCTGT 
PLLFTPVTKANSDSECPLS 
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Nsil 

CTCATGACGGCTACTGCCTGCATGACGGCGTATGCATGTACATCGAAGCTCTGGA 
HDGYCLHDGVCMYIEALD 



Sph I 

5 CAAGTACGCATGCAACTGCGTTGTTGGCTACATCGGCGAACGTTGCCAGTACCGT 
KYACNCVVGYIGERCQYR 



10 



BamHI BamHI 
GACCTGAAATGGTGGGAACTGCGTTAAGGATCCGTGACTAATTGGGGA 

DLKWWELR* 



(BamHI site of pBMll) 

Trans. Term. I 
CCCTAGAGGTCCCCTTTTTTATTTTAAAACGATCC 



15 Example IX. 

Isolation of Recombinant Polypeptides 

A. Growth Factors Produced in pBM-Based Vectors Using 
the PL Promoter and the ts CI Repressor 
20 E. coli B (HB101) containing the p3Mll/NDP/ 

growth factor plasmids were grown in Luria Broth at 
30°C. The density of the culture was measured at 
550 nm and when the density reached an absorbance of 
0.7 to 0.9 f synthesis of the growth factor fusion 
25 protein was induced by increasing the temperature to 
42°C. The culture was incubated at this temperature 
for 5-20 hrs, then the bacteria were isolated by cen- 
trifugation and frozen at -70°C until use. 

For isolation of the recombinant protein', the 
cells were thawed into buffer containing 0.05 M NaH 2 P0 4 
pH 7.2, 0.5 M NaCl, 0.01 M EDTA. One hundred fifty ml 
of buffer was used for a preparation from 50 g bacteria, 
The cells were disrupted by sonication on ice for 15 
min using a 1/4-inch probe, 50% pulse at SO watts of 
power. Following disruption of the cells, the insolu- 
ble protein was collected by centrifugation at 12,000 



30 



35 
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rpm in a GSA rotor for 90 min. The pellet containing 
the insoluble protein was then resuspended in 50 ml of 
6 M guanidine hydrochloride. The insoluble material was 
collected by centrifugation for 2 hrs at 25 , 000 rpm in a 
5 Beckman-type 30 ultracentrifuge rotor. The supernatant 
was collected and stored at -20°C until further use. 

Purification of the fusion protein was carried 
out on either a Sephacryl S300 or Practogel HW-55 column 
equilibrated with 1 M guanidine hydrochloride. Fractions 

10 containing the fusion protein were identified as those 
fractions containing a polypeptide having a molecular 
weight consistent with the molecular weight of the poly- 
peptide encoded by the synthetic gene as determined on a 
15% polyacrylamide-urea gel. 

15 - To obtain an active form of the recombinant 

growth factor , the fusion protein was allowed to refold 
by incubating it in 50 mM Tris-HCl buffer , pH 8.7, con- 
taining 1 M guanidine hydrochloride, 1.25 mM reduced 
glutathione, and 0.25 mM oxidized glutathione at 4°C for 

20 3-10 days. The biological activity of the growth factor 
was monitored by a competitive receptor binding assay as 
described above (see Example I.C). When a maximum level 
of activity was obtained, the protein was dialyzed against 
distilled water and lyophilized to dryness. 

25 If it was desired to remove the leader 

sequence, the protein was cleaved either by 
resuspending in 70% formic acid and incubating at 40 °C 
for 3 days or by incubating overnight at room 
temperature in a 100-fold molar excess of cyanogen 

30 bromide. The cleaved product was dialyzed against dis- 
tilled water and lyophilized to dryness. 

To further purify the recombinant growth 
factor, the growth factor was resuspended in 40% 
acetonitrile, 0.1% TFA and purified by HPLC using a 

35 Biofiad^ISK-250 column. Fractions containing the growth 
factor were pooled and further purified using reversed- 
phase HPLC, either Waters vBondapak C-18 or Rainin 
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Dynaraax C-8. The eluant was a linear gradient of 20- 
40% acetonitrile containing 0.1% TFA. Fractions 
containing receptor binding activity were pooled, 
lyophilized and stored at -20°C until use. 
5 1. TGF and Modified TGF 

a. N/TGF 

Recombinant modified human TGF was produced 
from plasmid pBMll/N/TGF and contained 33 amino acids 
of the N-gene at the N-terminus and the sequence 
10 modification QEEK instead of the natural human sequence 
QEDK. 

2. Modified and Truncated VGF 

a. PAD/nVGFa 

Recombinant modified VGF was produced from 
15 plasmid pBMll/PAD/nVGFa containing the extreme N- 

terminal sequence of VGF and the modified sequences GTC 
and GYACRC instead of the natural VGF sequence GDC and 
GMYCRC. The nVGFA fragment was expressed as a fusion 
protein with a modified alkaline phosphatase signal 
20 sequence at the N-terminus and was truncated at the 
sequence YQR at the C-terminus. 

b. NDP/VGFa 

Recombinant modified VGF was produced from 
plasmid pBMll/NDP/VGFa beginning at the DIPAIR sequence 

25 and ending at the YKQR sequence in- VGF. It has the 
modified sequences GTC and GYACRC instead of the 
natural VGF sequence GDC and GMYCRC. The VGFa fragment 
was expressed as a fusion protein with 32 amino acids 
of the N-gene at the N-terminus and the acid labile di- 

30 peptide aspartic acid-proline. 

c. VGFa 

The VGF fragment was prepared as described in 
2.b above and, after cleavage from the fusion protein 
by acid treatment, was subsequently further purified by 
35 HPLC. 
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d. NDP/VGFA 
Recombinant modified VGF was produced from 
plasmid pBMll/NDP/VGFA beginning at the DIPAIR sequence 
and ending at the YKQR sequence in VGF and having the 
5 modified sequences GTC and GYACVC instead of the 

natural VGF sequence GDC and GMYCRC . . The VGFA fragment 
was expressed as a fusion protein with 32 amino acids 
of the N-gene at the N-terminus and the acid labile 
dipeptide aspartic acid-proline. 
10 3. Chimeric TGF/VGF Hybrids 

a, N/TTV (TGF/TGF/VGF) 

Recombinant modified TTV was produced from 
pBMll/N/TTV and contained the amino acid sequence of 
human TGF in the amino terminal two-thirds of the gene 

15 with the exception of the sequence QEEK which was 
altered from the natural human sequence QEDK. The 
car boxy terminus was derived from the amino acid 
sequence of VGF and terminated with the sequence YQR 
upstream of the natural sequence PNT. The TTV fragment 

20 was expressed as a fusion protein with 33 amino acids 
of the N-gene at the N-terminus. 

b. NDP/TTV 

Recombinant TTV was produced from plasmid 
pBMll/N/TTV and modified as described in (a) except 
25 that the TTV fragment was expressed as a fusion protein 
with 32 amino acids of the N-gene at the N-terminus and 
the acid labile dipeptide aspartic acid-proline. 
C. • NDP/VTV 
Recombinant modified VTV was produced from 
30 plasmid pBMll/NDP/VTV and contained the amino acid 
sequence of human TGF in the middle domain with the 
amino acid sequence QEEK replacing the natural sequence 
QEDK. The N-terminal and C-terminal domains were 
derived from the truncated VGF sequence and begin with 
35 the sequence DIPAIR and end with the sequence YQR. The 
VTV fragment was expressed as a fusion protein with 32 
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amino acids of the N-gene at the N-terminus and the 
acid labile dipeptide aspartic acid-proline. 
d. NDP/TW 
Recombinant modified VTV was produced from 
5 plasmid pBMll/NDP/TW and contained the amino acid 

sequence of human TGF in the N-terminal domain of the 
gene. The middle and C-terminal domains were derived 
from the truncated VGP sequence and end with the 
sequence YQR. In addition, the synthetic gene has the 
10 modification GYACVC for GMYCRC. The TW fragment was 
expressed as a fusion protein with 32 amino acids of 
the N-gene at .the N-terminus and the acid labile 
dipeptide aspartic acid-proline. 

15 B. Growth Factors Produced in Vectors Comprising the 
tac or lac Promoters 

Bacterial hosts containing expression 
cassettes which comprise the tac or lac promoters were 
grown at 30 to 37°C to an optical density of A600 = 0.2 
20 to 0.8 in either LB broth or a chemcially defined 

medium such as M9 medium supplemented with thiamine and 
glucose. Am appropriate antibiotic was included in the 
growth medium to select for hosts containing the 
expression cassette. The bacterial cultures were 
25 induced with 100 to 1000 mM concentrations of IPTG and 
were allowed to grow at 30°C for 16 to 24 hours. For 
the expression cassettes lacking a lad gene the 
bacterial hosts carried an F-factor with thfe laclq 
gene, such as JM109, XL1, JM103, etc. For expression 
cassettes which carry the lad gene, examples of 
bacterial hosts are HB101, DH1, OH5, etc. In the case 
where the bacterial host has a functional lac operon 
(lac+) f the expression cassette can be induced with 1% 
lactose. After the induction period, growth factors 
can be isolated from either the medium or the cell 
pellet . 
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1. PAK/EGF 

Human EGF produced from the expression 
cassettes TacPak/EGP and pTcCPt/EGF was isolated from 
the medium in an active form with the alkaline 
5 phosphatase signal sequence removed. Approximately 85% 
of the active EGF is found in the medium, with the 
remainder associated with the call pellet. These 
expression cassettes have yielded 4 mg/1 of active 
EGF. The cells are removed from the medium by 

10 centrifugation and the medium is passed through an 

Amicon SY30, 30/000 M r cutoff spiral filter and then 
passed through a Q-Sepharose column and the highly 
purified human EGF was eluted in 20 mtt NaP0 4 pH7 with a 
0 to 0.5 M NaCl gradient. Alternatively, the growth 

15 factors can be isolated from the cell pellet by osmotic 
shock or sonication and purified by essentially the 
same procedure. 

2 . PAK/nVGFa 

Recombinant nVGFa was produced from the 
20 expression cassette pTCPt/nVGFa. The nVGFa was 

isolated from the cell pellet by sonication and was 
shown to constitute approximately 40% of the total 
bacterial protein. 

25 C. Platelet Factor 4 

Recombinant Platelet Factor 4 was isolated 
essentially as described above for growth factors. 

1. N/PF4 

Recombinant Platelet Factor 4 was produced 
30 from pBMll/N/PF4 as a fusion with the 33 N-terminal 
amino acids of the N-protein. 

2. NDP/PF4 

Recombinant Platelet Factor 4 was produced 
from pBMll/NDP/PF4 as a fusion with the 33 N-terminal 
35 amino acids of the N-protein and an aspartic-acid- 

proline cleavage site. Treatment of the fusion protein* 
with formic acid released the mature PF4. 
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D. Oncostatin M 

* 1. NDP /Oncostatin M 

a. Purification of Recombinant Oncostatin M 
5 Recombinant OncM fusion proteins were purified 

from E. coli as follows: A cell pellet from a 500 ml 
culture of E. coli was suspended in 40 ml of PNE buffer 
(0.5 M NaClr 10 mM EDTA, 50 mM sodium phosphate), and 
lysed by sonication. Aggregated proteins were collec-. 

10 ted from the cell lysate by centrifugation. Aggregated 
proteins were sequentially extracted for 16 hrs each 
with 120 ml of 8 M urea solutions buffered as follows: 
Solution 1) 20 mM Tris, pH 5; Solution 2) 20 mM Tris, 
pH 8; Solution 3) 50 mM Tris, pH 11, Most aggregated 

15 proteins were solubilized by Solutions 1 and 2, while 
recombinant Onco M remained insoluble until treatment 
with Solution 3. 

b. Refolding of Recombinant Molecules 

The Solution 3 extract was then dialyzed for 
20 24 hrs against a refolding buffer (1 M guanidine HC1, 
1.2 mM oxidized glutathione, 0.2 mM reduced glutathio- 
nine, 20 mM Tris EC1, pH 8.0-9.0). Lowering the pH to 
<pH 8.0 resulted in a 100-fold reduction in yield of 
biologically active OncM. Following re-folding, pro- 
25 teins were dialyzed versus 1 N acetic acid before 

testing in the growth inhibitor assay (Example I.E). 
2. PAD/Oncostatin M 

The PAD/OncoM was prodduced from the 
expression cassette pBMll/PAD/OncuM. The medium was 
30 tested for active OncoM and was shown to contain "from 
20 to 500 yg/1. 
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Example X. 

Biological Activity of Recombinant Growth Factors 
Prepared in Procaryotic Cells 



10 



A. EGF Receptor Binding 

This assay determines the ability of a molecule 
to bind to the EGF receptor as measured by its ability to 
inhibit the binding of EGF to its receptor. All growth 
factors and chimeric growth factors, whether modified or 
truncated, isolated to date were active in the EGF 
receptor binding inhibition assay. A summary of these 
results is shown below: 
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Table 

EGF Receptor Binding of Recombinant Growth Factors 



20 



25 



30 



35 



Peptide 

N/TGF - modified 
truncated fusion 

PAD/nVGFa - 
modified truncated 
fusion 

NDP/VGFa - 
modified truncated 
fusion 

NDP/VGFA - modified 
truncated fusion 

N/TTV - modified 
truncated chimeric 
fusion 

NDP/TTV - modified 
truncated chimeric 
fusion 



Expression 
Cassette 



pBMll/N/TGF 



Binds 
to EGF 
Purity Receptor 



pBMll/N/TTV 



pBMll/NDP/TTV 



95% 



pBMll/PAD/nVGFa >9 5 % 



pBMll/NDP/VGFa >95% 



pBMll/NDP/VGFA >95% 



>95% 



>95% 



Yes 



Yes 



Yes 



Yes 



Yes 



Yes 
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NDir/VxV 

truncated 
rusion 


muuiixcu 
chimeric 




>95% 


Yes 


NxJir/XVV 

truncated 
fusion 


QlUQli lcU 

chimeric 


n DM1 1 /MHO /TW 


J W ^ 


Yes 


PAK/EGF 




pUMX X / rAJX/ Ci\a£ 


7 


Yes 

X C 2 


EGF 




poMXX/ rt\J\/ Lur 




Yes 

X c s 


PAD/EGF 




poMX 1 / FAD/ £iU£ 




X 


EGF 




pnMXX/ rAV/ rAa£ 


7 3 7 


X CO 


PAK/EGF 




TacPak/EGF 


95% 


Yes 


EGF 




TacPaJc/EGF 


95% 


Yes 


PAK/EGF 




pTCPt/EGF 


95% 


Yes 


EGF 




pTCPt/EGF 


>95% 


Yes 



20 

A comparison of the binding inhibition curves 
for natural mouse EGF and the bacterially expressed re- 
combinant chimeric growth factor N/TTV (a polypeptide 
fusion of the 32 N-terminal amino acids of the lambda 
25 N-gene and the modified and truncated TGF/VGF hybrid) 

suggested that there were no differences in the binding 
activity. 

B. Mitogenic Activity 
3Q The activity of several of the purified -growth 

# factors was tested and the activity determined in all 

cases was comparable to the effect caused by EGF. The 
, compounds tested are as indicated below: 



35 
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Table 



Mitoqenic Activity of Growth Factors 



5 



Peptide 



Mitogenic 
Activity * 



TTV - modified truncated chimeric 



Yes 



10 



N/TTV - modified truncated 
chimeric fusion 



Yes 



* As measured by 3 H-thymidine or 125 I-IdU 
incorporation. 



15 

C. Wound Healing 

1. Mid-dermal Injuries 

The effect of natural or synthetic TGF, EGF 
an*d VGF, as well as recombinant growth factors on mid- 
20 dermal injuries was assessed as described in Example 
VE1. The percent of the original burn area which had 
healed was measured by computer-assisted telemetry , and 
the percent wound re-epithelialization was 
determined. Untreated wounds were approximately 15% 
25 reepithelialized. Treatment with Silvadene alone or 
Silvadene with EGF resulted in approximately 50% 
reepithelialization, while treatment with synthetic TGF 
or natural VGF resulted in approximately 90% re- 
epithelialization. The optimal concentration to 
30 promote re-epithelialization of EGF was 1-10 yg/ml, 

while synthetic TGF and natural VGF produced a maximal 
response at 0.1 ug/ml. 



were done to test the effect in wound healing of either 
35 TGF f a modified, truncated form of VGF (VGFa) , or a 

modified, truncated chimeric fusion of TGF and VGF (TTV) , 
all of which were produced by recombinant technology in 



Experiments similar to those described above 
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bacteria. These recombinant growth factors and hybrid 
growth factors accelerated wound healing to the same 
extent as either synthetic TGF or natural VGF, with an 
optimal concentration at 0.1 yg/ml. 
5 2. Mid-dermal Donor Graft Injuries 

Modified truncated VGFa was assayed for its 
ability to accelerate wound healing in a donor graft 
model. The treatment regimen was as described above 
(see Example I) using 1 ml VGFa, 5 yg/ml, in 20 g 
10 Silvadene. Photographs were taken on a daily basis. A 
summary of the results is provided below: 
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Table 

Effect of Recombinant VGFa on Wound Healing 



20 



Treatment* 
Saline 



POD** 7 

very 

open 



Wound Condition 



POD 8 

open 

with 

some 

healing 



POD 9 

mostly 

healed 



POD 10 
healed 



25 



VGFa - 

modified, 

truncated 



open; mostly 
apparent healed 
epitheli- 
alization 



healed healed 



* Silvadene is the vehicle 
** POD = Post Operative Day 



30 



35 



The modified truncated VGFa accelerated heal- 
ing of the wound as compared to the carrier control. 
Photographs (not provided) at POD 8 showed substantial 
differences between saline and VGFa. 
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Example XI. 

Biological Activity of Recombinant Platelet Factor 4 
Prepared in Prokaryotic Cells 

5 

The yield of recombinant Factor 4 prepared 
using different expression cassettes varied from about 
2% to about 20% of total cell protein. These results 
are summarized below: 



Table 



15 Expression of PF4 in Different 

Bacterial Expression Systems 



Percent of 
Total Protein 



pBMll/Ngene/PF4 2 0 % 

20 

pBMll/Ngene/DP/PF4 20% 

pBMll/PAD/PF4 15% 

pBMll/PF4 2% 



25 

A. Inhibition of DNA Synthesis 

The highest activity seen was with the fusion 
protein from the plasmid pBMll/Ngene/PF4 . Fifty 
percent of maximum inhibition of A549 cells was 
30 obtained with 0.67 yg/well. 

B. Inhibition of Growth of Tumors in Nude Mice 

Male nude mice. were injected with platelet 
factor 4 or phosphate buffered saline at 2- to 3-day 
35 intervals , as described in Example 1C. As shown below, 
factor 4 significantly inhibited tumor growth. 
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Table IV 

Effect of Recombinant N-Gene Platelet Factor 
Fusion Protein on 
Growth of Tumors in Nude Mice 



3 



10 



Tumor size (mm 3 ) 



Days Post Treatment 

0 
6 
9 

14 
17 
20 



Control Platelet Factor 4 

17 25 

205 25 

275 25 

420 40 

545 50 

635 85 
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Example XII, 

Biological Activity of Recombinant Oncostatin 
Prepared in Prokaryotic Cells 

20 

A. Physicochemical Characterization of Recombinant 
Oncostain M 
1. SDS-PAGE 

Cultures (50 ml) were grown and induced as de- 

25 scribed. Cultures were pelleted and the cell pellets 
were solubilized in 6 M guanidine HC1. Insoluble pro- 
teins were not removed at this point. Samples for SDS- 
PAGE analysis were dialyzed directly against 1 N acetic 
acid without refolding. 

30 Aliquots consisting of approximately 6 yg of 

total bacterial protein were analyzed by SDS-PAGE on 
10-20% gradient gels (5% stacking gel). Gels were 
stained with Coomassie Brilliant Blue, destained and 
dried. The apparent molecular weight M r of the NDP- 

35 OncM fusion protein was estimated to be 32,000 by com- 
parison of its mobility with that of standard proteins. 
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B. Growth Inhibitory Activity of Recombinant 
Oncostatin M (N-Gene Fusion Protein ) 

Cellular proliferation in the presence of 
recombinant or native Oncostatin M was compared with 
5 proliferation in untreated samples, and expressed as a ^ 
percentage of maximal (untreated) growth. Samples were 
assayed in duplicate or triplicate, and generally 
varied by less than 20% from each other. One growth ^ 
inhibition assay unit of Oncostatin M is defined as the 

10 amount of protein required to cause a 50% inhibition of 
the growth of A375 cells seeded at 3 to 4 x 10 3 during 
a 72-hr assay. Where indicated, concentrations 
required for half-maximal growth inhibition were 
determined by extrapolation from proliferation data 

15 after transformation as follows:* 

1-(A- Qn treated - Aj-QnMaximal) 

% Maximal Inhibition = 100 x -r — 7—3 = r~ 

590 u a ~ -Agggmaximal 

20 

In some cases, a modified proliferation assay 
was used, in whihc the target cell number and serum 
concentration were reduced* Cells were seeded at 500 
cells/well in DMEM containing 5% FBS, treated with 
25 Oncostatin M in the same medium, and incubated at 37 °C 
until untreated cells reached confluence (generally 
between 6 and 10 days, depending on the cell line). 
Monolayers were then stained and processed as described 
below : 

30 ■% 
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Table 

Growth Inhibitory Activity 
of Recombinant Oncostatin M 



Recombinant 1 Oncostatin M 



Native Oncostatin M 



10 



pM 



240 
48 
9.6 
1.9 
0.38 
0.08 



15 



Percent 
Maximal Growth 

12.2 
19.5 
30.4 
70.0 
77.8 
81.5 
86.4 
85.9 
100.0 



pM_ 



160 
32 
6.4 
1.3 
0.26 
0.05 



Percent 
Maximal Growth 

15.1 
22.2 
41.5 
67.3 
82.3 
76.0 
72.4 
77.5 
100.0 



'Prepared using pBMll/NDP/OncoM expression cassette. 
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25 



30 



35 



C. Receptor Binding Activity of Recombinant 
Oncostatin M (N-Gene Fusion Protein ) 

A 54g human lung carcinoma cells were incubated 
with l2 . s I-Oncostatin M in the presence of increasing 
amounts of unlabeled Oncostatin M. A fraction (-3) of 
added "si-Oncostatin M bound to these cells in the 
absence of unlabeled Oncostatin M. When binding was 
measured in the presence of unlabeled native or 
recombinant Oncostatin M, a concentration-dependent 
inhibition of binding of l2 si-Oncostatin M was observed 
(half-maximal effect at -300 pM) . Total binding of 
1251-oncostatin M was inhibited by approximately 90% at 
the highest concentration of unlabeled Oncostatin M 
tested. Native and recombinant Oncostatin M did not 
differ significantly from each other in their abilities 
to inhibit binding of i 25 I-Oncostatin M. The results 
are as shown below: 
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Table 

Receptor Binding Activity 
of Recombinant OncostatinM 



Recombinant 1 OncostatinM Native Oncostatin M 



Percent Percent 

10 pM Maximal Growth pM Maximal Growth 

16,000 14 27,000 12 

3,200 27 5,400 20 

640 48 1,080 40 

128 72 216 67 

26 80 43.2 81 

iq 5.1 92 8.6 88 

■ 1 90 1.7 89 

0 100 0 100 



^Prepared using pBMll/NDP/OncoM expression cassette. 
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The compositions of the subject invention 
comprise expression casettes for the efficient 
expression of polypeptides in prokaryotic cells. The 
expression cassettes find use in production of large 
amounts of polypeptides by providing for increased 
stability of the expression products as well as for 
obtaining mature folded polypeptides secreted into the 
growth medium of the host cell. 

All publications and patent applications 
mentioned in this specification are indicative of the 
level of skill of those skilled in the art to which 
this invention pertains. All publications and patent 
applications are herein incorporated by reference to 
the same extent as if each individual publication or 
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patent application was specifically and individually 
indicated to be incorporated by reference. 

The invention now being fully described , it 
5 will be apparent to one of ordinary skill in the art 

that many changes and modifications can be made thereto 
without departing from the spirit or scope of the 
appended claims* 
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WHAT IS CLAIMED IS; 



10 



15 



1. An expression cassette comprising: 
P — S • D . — met — G^ 

wherein 

P comprises a promoter sequence; 

S.D. comprises a Shine-Dalgar no sequence; 

met comprises a codon for an initiating 

methionine ; 

comprises a first DNA sequence 
encoding a polypeptide of interest , wherein the codons 
for the N-terminal amino acids are modified using codon 
degeneracy so that the nucleotides encoding said 
polypeptide of interest approximate those of the native 
nucleotide sequence associated with said S.D. 



20 



2. The expression cassette according to 
Claim 1, wherein said S.D. is the N-gene Shine-Dalgarno 
sequence. 



25 



30 



35 



3. The expression cassette according to 
Claim 2, wherein said S.D. and said met are separated 
by from about 5 to 9 nucleotides. 



4. 



wherein 



An expression cassette comprising: 
P — S.D. — met — L — G 

P comprises a promoter sequence; 

S.D. comprises a Shine-Dalgarno sequence; 

met comprises a codon for an initiating 



methionine; 



L comprises a first DNA sequence encoding 
a leader sequence wherein the codons for the amino 
acids are modified using codon degeneracy so that the 
nucleotides encoding said leader sequence approximate 
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those of the native nucleotide sequence associated with 
said S.D.; and 

G comprises a second DNA sequence 
encoding a polypeptide of interest. 

5 

5. The expression cassette according to 
Claim 4, wherein the first about 7 to 30 codons of said 
first DNA sequence comprise modified codons. 

10 6. The expression cassette according to 

Claim 5, wherein said S.D. comprises the N-gene Shine- 
Dalgarno sequence and said leader sequence comprises an 
alkaline phosphatase signal sequence. 

15 7. The expression cassette according to 

Claim 4, whereih said leader sequence is a an N- 
terminal amino acid sequence from a highly expressed 
gene; a hydrophobic amino acid sequence; or a 
hydrophilic amino acid sequence. 

20 

8. The expression cassette according to 
Claim 7, wherein said highly expressed gene is the 
bacteriophage lambda N-protein gene or Cro gene, or the 
bacterial beta-galactosidase gene. 

25 

9. The expression cassette according to 
Claim 7r wherein said hydrophobic amino acid sequence 
is the bacterial alkaline phosphatase signal sequence. 

30 10. The expression cassette according to 

Claim 7 , wherein said hydrophilic amino acid sequence 
is about 41 N-terminal amino acids from amphiregulin. 



35 



11. An expression cassette comprising: 
P — S.D. — met — L — G 

wherein 
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P comprises a bacteriophage lambda P^ 
promoter; a bacterial lac promoter; or a bacterial trp- 
lac fusion promoter. 

S.D. comprises a Shine-Dalgarno sequence; 
5 met comprises a codon for, an initiating 

methionine; 

L comprises a first DNA sequence encoding 
a leader sequence; and 

G comprises a second DNA sequence 
10 encoding a polypeptide of interest. 

12. An expression cassette according to Claim 
11 , wherein said promoter further comprises a 
regulatory sequence. 



15 



20 



13. An expression cassette according to Claim 
12 , wherein said regulatory sequence comprises a 
bacteriophage lambda operator; or a bacterial lac 
operator. 

14. An expression cassette according to Claim 
11/ wherein said S.D. comprises an N-gene or a Cro gene 
ribosomal binding site. 



25 15. An expression cassette according to Claim 

11 r wherein said leader sequence comprises about 8 to 
about 35 N-terminal amino acids from a bacteriophage 
lambda N-gene or Cro gene; or a bacterial alkaline 
phosphatase gene. 

30 

16. An expression cassette according to Claim 
11 , further comprising at least one codon joined in 
reading frame between said first DNA sequence and said 
second DNA sequence. 



35 
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17. An expression cassette according to Claim 

16 , wherein said codons encode a chemical cleavage site 
or an enzymatic cleavage site. 

5 18. An expression cassette according to Claim 

17, wherein said chemical cleavage site is aspartic 
acid-proline. 

J 

19. An expression cassette comprising: 
10 P — S.D. — met — L — G 

wherein 

P comprises a promoter having a -35 and a 
-10 regulatory sequence; 

S.D. comprises a Shine-Dalgarno sequence; 
15 met comprises a codon for an initiating 

methionine; 

L comprises a first DNA sequence encoding 
an alkaline phosphatase signal sequence, wherein the 
codons for the amino acids of said signal sequence 
20 approximate those of the native nucleotide sequence 
associated with said S.D.; and 

G comprises a second DNA sequence 
encoding a polypeptide of interest. 

25 20. The expression cassette according to 

Claim 19, wherein said -35 and said -10 regulatory 
sequences are substantially the same as those from the 
bacterial lac promoter. 

30 21. The expression cassette according to 

Claim 20, wherein said regulatory sequences are from 
the bacterial lac promoter. 

22. The expression cassette according to 
35 Claim 19, wherein said -35 regulatory sequence is 

substantially the. same as the -35 regulatory sequence 
from the trp promoter and the -10 regulatory sequence 
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is substantially the same as the -10 regulatory 
sequence from the lac promoter. 

23. The expression cassette according to 

5 Claim 22, wherein said -35 regulatory sequence is from 
the trp promoter and said -10 regulatory sequence is 
from the lac promoter. 

24. The expression cassette according to any 
10 one of Claims 21 and 23 wherein said S.D. comprises the 

N-gene Shine-Dalgarno sequence. 

25. The expression cassette according to 
Claim 23 f wherein said S.D. comprises the Cro gene 

15 Shine-Dalgarno sequence. 

26. The expression cassette according to 
Claim 25, wherein said S.D. is separated from said met 
by about 11 or 12 nucleotides. 

20 

27. The expression cassette according to 
Claim 24, wherein said S.D. is separated from said met 

- by about 7 or 8 nucleotides. 

25 28. An expression cassette comprising: 

P — S . D . — met — L — G — T 

wherein 

P comprises a promoter having a -35 
regulatory sequence from the trp promoter and a -10 
30 consensus regulatory sequence from the lac promoter; 

S.D. comprises a Cro gene Shine-Dalgarno 

sequence; 

met comprises a codon for an initiating 

methionine; 

35 L comprises a first DNA sequence encoding 

an alkaline phosphatase signal sequence, wherein the 
codons for the amino acids are modified using codon 
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10 



degeneracy so that the nucleotides encoding said signal 
sequence approximate those of the native nucleotide 
sequence associated with an N-gene Shine-Dalgarno 
sequence; 

G comprises a second DNA sequence 
encoding a polypeptide of interest; and 

T comprises a transcriptional termination 
region substantially similar to the gene 32 
transcriptional termination region. 



29. An expression cassette comprising: 

P — S.D. — met — L — G — T 

wherein 

P comprises a promoter, having a -35 
15 regulatory sequence from the trp promoter and a -10 
regulatory sequence from the lac promoter; 

S.D. comprises an N-gene Shine-Dalgarno 

sequence; 

met comprises a codon for an initiating 

20 methionine; 

L comprises a first DNA sequence encoding 
an alkaline phosphatase signal sequence, wherein the 
codons for the amino acids are modified using codon 
degeneracy so that the nucleotides encoding said signal 
25 sequence approximate those of the native nucleotide 
sequence associated with said S.D.; 

G comprises a second DNA sequence 
encoding a polypeptide of interest; and 

T comprises a transcriptional termination 
30 region substantially similar to the transcriptional 
termination region of gene 32. 

30. An expression cassette comprising: 

P — S . D . — met — L — G — T 

35 wherein 

P comprises a promoter having a -35 
regulatory sequence and a -10 consensus regulatory 
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sequence from the lac promoter; 

- S.D. comprises a N-gene Shine-Dalgarno 

sequence; 

met comprises a codon for an initiating 

5 methionine; 

L comprises a first DNA sequence encoding 
an alkaline phosphatase signal sequence , wherein the 
codons for the amino acids are modified using codon 
degeneracy so that the nucleotides encoding said signal 
10 sequence approximate those of the native nucleotide 
sequence associated with said S.D.; 

G comprises a second DNA sequence 
encoding a polypeptide of interest; and 

T comprises a transcriptional termination 
15 region substantially similar to the transcriptional 
termination region of gene 32. 

31. * An expression cassette comprising: 
P— S . D . — met— L — G — T 

20 wherein 

P comprises a promoter having a -35 
consensus regulatory sequence from the trp promoter and 
a -10 consensus regulatory sequence from the lac 
promoter; 

25 S.D. comprises a Cro gene Shine-Dalgarno 

sequence; 

met comprises a codon for an initiating" 

methionine; 

L comprises a first DNA sequence encoding 
30 an alkaline phosphatase signal sequence , wherein the 
codons for the amino acids are modified using codon 
degeneracy so that the nucleotides encoding said signal 
sequence approximate those of the native nucleotide 
sequence associated with said the N-gene Shine-Dalgarno 
35 sequence; 

G comprises a second DNA sequence 
encoding a polypeptide of interest; and 
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T comprises a transcriptional termination 
region from gene 32. 

32 • A prokaryotic cell comprising an 
5 expression cassette according to Claims 28 to 31. 

33- A cell according to Claim 32 , wherein 
said cell is an E. coli cell. 

10 34. A fusion protein comprising a polypeptide 

of interest and about 8 to about 35- N-terminal amino 
acids from a bacteriophage lambda N-protein or Cro 
protein or a modified bacterial alkaline phosphatase 
signal sequence joined to the N-terminus of said 

15 polypeptide of interest. 

35. A fusion protein according to Claim 34 
further comprising a central region of at least one 
amino acid between' said polypeptide of interest and 
20 said N-terminal amino acids. 



25 



36. A fusion protein according to Claim 35/ 
wherein said central region comprises a chemical 
cleavage site or an enzymatic cleavage site. 
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